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(57) Abstract: This invention provides 

, — methods for producing plants with 

ri}S^rti592m! riJS^r^ri! increased seed and biomass production. 

SSSS More specifically, this invention provides 

Pm P-< Ph Pm P^ Pm Mh Ph methods for producing plants which 

have increased yields for a number of 
plant traits, including seed number, seed 
weight, the number of seed heads, flag 
leaf weight and total plant weight. This 
invention also provides methods for 
improving the Harvest Index of a plant. 
In a preferred embodiment, the methods 
comprise introducing into a plant a 
nucleic acid wherein the nucleic acid is 
selected from the group consisting of a 
nucleic acid comprising SEQ ID NO: 3, 
a nucleic acid which hybridizes with SEQ 

ID NO: 3 under high stringency conditions and encodes a polypeptide that retains biological activity of SH2-REV6-HS, a fragment 
of SEQ ID NO: 3 encoding a peptide that retains biological activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide 
comprising SEQ ID NO: A, or a fragment thereof that retains biological activity of SH2-REV6-HS, and a nucleic acid encoding an 
SH2HS or an SH2RTS polypeptide. The present invention also concerns plants obtained by the methods provided herein. 




Shrunken! 
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DPSCRJPTOM 

TRANSGENIC PLAN TS WTTH INCREASED SEED YIELD. 
BIQMASS AND HARVEST INPEX 

Field of the Invention 
This invention relates to improving plant production, both plant seed production 
and plant biomass production. More specifically, this invention relates to transgenic 
plants which have increased seed production and increased biomass production when 
compared to non-transgenic plants of the same genetic background. Even more 
specifically, this invention relates to plants which are transgenic for Sh2'Rev6-HS and to 
methods for producing such plants. 

Background of the Invention 
ADP glucose pyrophosphorylase (AGP) is one of the primary enzymes involved 
in the biosynthesis of starch and glycogen in organisms such as plants, algae, fungi and 
bacteria, particularly plants. AGP catalyzes the following reaction: 

a-glucose- 1 -P+ ATP ADP-glucose+PPi . 

ADP-glucose, the product of the above reaction, is the major donor of glucose in the 
biosynthesis of starch in plants and in the biosynthesis of glycogen by bacteria. 

AGP is widely distributed throughout the plant kingdom. It is present in 
monocots such as wheat, rice, barley, and maize, as well as dicots such as spinach, potato, 
and pea. It is also found in some starch producing bacteria, such as E. coli. Plant AGP 
exists as a tetramer (210 to 240 kDa) composed of two small sub-units (50 to 55 kDa) 
and two large sub-units (5 1 to 60 kDa) in contrast to bacterial AGP which appears to 
consist of four units of equal size. AGP has also been shown to be produced in 
cyanobacteria and in algae, where its tetrameric structure is similar to that in plants, i.e. 
two large and two small sub-units, rather than the homotetrameric structure found in 
ordinary bacteria. 
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Because of the commercial importance of starch, primarily as a foodstuff but also 
as an important industrial chemical, much work has been done to isolate and characterize 
the nucleic acid encoding AGP. Plant AGP consists of two different protein subunits. 
In maize endosperm, AGP is encoded by the Shrunken-2 {Sh2) and Brittle-2 (5/2) genes 
5 (Bhave et a/., 1990 and Bae ei ai, 1990). Sh2 encodes the large subunit having a 
predicted molecular weight of 57,179 Da, while Bt2 encodes the small subunit having a 
molecular weight of 52,224 Da. The isolation of nucleic acids encoding AGPs from 
various other plants has also been reported: the small subunit cDNA (Anderson et al, 
1989) and the genomic DNA (Anderson et ai, 1991) from rice; the small and large 
10 subunit cDNAs from spinach leaf (Morell et ai, 1988); and the small and large subunit 
cDNAs from potato tuber (Muller-Rober et al, 1990; and Nakata et al, 1991). 

Moreover, work has been done to alter AGP expression in plants in order to 
regulate starch synthesis. EP 455,316 provides a plasmid that comprises a DNA 
encoding AGP placed in an inverted orientation, which results in the transcription of the 
1 5 anti-sense mRN A in a host plant. The patent shows that transgenic potatoes comprising 

the plasmid has reduced AGP activity and reduced starch concentration compared to non- 
transformed plants. U.S. Patent No. 5,773,693 discloses a method of increasing sucrose 
content of pea plant by suppressing or reducing the expression of either or both subunits 
of AGP. The method comprises transforming a pea plant with a plasmid comprising 
20 nucleic acid encoding the Sh2 subunit or the Bi2 or both subunits in antisense orientation 

to the promoter and the terminator. 

In contrast, U.S. Patent No. 5,977,437 teaches a method of increasing the rate and 
/or yield of starch production in a plant comprising introducing into a plant, a nucleic acid 
encoding barley endosperm AGP operably linked to a plastid transit peptide. EP 634,491 
25 discloses a method of decreasing oil content in seed by increasing the amoxmt of starch 
comprising transforming a plant cell with a nucleic acid comprising a promoter, and a 
DNA encoding a fiision protein comprising an amino terminal plastid transit peptide, an 
AGP enzyme, and a 3' non-translated transcription termination sequence, obtaining 
transformed plant cells, and regenerating transformed plants from the transformed plant 
30 cells. Finally, U.S. Patent No. 5,792,290 discloses the nucleic acid encoding wheat AGP 
and teaches inserting extra copies of the AGP gene into a plant genome by transformation 
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to enhance starch production and inserting the complement of the mKNA encoding the 
endogenous AGP to reduce starch production. 

The maize endosperm is the site of most starch deposition durmg kernel 
development. Sh2 an&Bt2 maize endospenn mutants have greatly reduced starch levels 
5 corresponding to deficient levels of AGP activity. Mutations of either gene have been 
shown to reduce AGP activity by about 95% (Tsai et ai, 1966; Dickinson et al, 1969). 
Lack of AGP and a decrease in starch levels compared to that of the wild-type endosperm 
result in shrunken, brittle, and/or collapsed kernels at seed maturity. Furthermore, it has 
been observed that enzymatic activities increase with the dosage of functional wild-type 
10 Sh2 and Bt2 alleles, whereas mutant enzymes have altered kinetic properties. 

AGP is the rate limiting step in starch biosynthesis in plants. Stark et al placed 
a mutant form of E. coli AGP in potato tuber and obtained a 35% increase in starch 
content (Stark et al, 1992). AGP is an allosteric enzyme, Le. its activity is regulated 
through the binding of an effector to an allosteric site. In plants, the positive effector of 
1 5 AGP is 3-phosphoglycerate (3-PGA), and the negative effector is phosphate (Dickinson 
et al, 1969). Inhibition of AGP by phosphate is likely the largest limitation on starch 
biosynthesis in plants (Giroux et al, 1996). 

Giroux et al (1996; U.S. Patent Nos. 5,872,216 and 5,589,618, each of which 
is hereby incorporated by reference in their entireties) used in vivo, site-specific 
20 mutagenesis to create short insertion mutations in a region of the gene known to be 
involved in the allosteric regulation of AGP. Single mutations of the Sh2 gene containing 
an insertion of an additional tyrosine or serine residue reduced total AGP activity and the 
amount of SH2 protein. A specific revertant containing an additional tyrosine residue 
and an additional serine residues increased seed weight 11-1 8%. This later revertant were 
25 named ''Sh2-mlRevff'' (this gene is designated as '"Sh2-Revff' herein). Giroux et al 
(1996) also found that the increase in seed weight of the Sh2'mlRev6 was not solely 
attributable to an increase in starch content, although there was an increase in the absolute 
starch content in the variant expressing Sh2-mlRev6. Giroux et al (1996) suggested that 
the enhanced starch synthesis caused by Rev6 creates a stronger sink within the seed 
30 leading to increased synthesis of other seed components. Mutations in AGP conferring 
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increased heat stability to a plant expressing the mutant AGP are disclosed in U.S. Patent 
No. 6,069,300 and published PCT application WO 99/58698. 

Modulation of the sink strength of a plant is one of the methods to increase 
harvest yield. The leaves and other green tissue active in photosynthesis are commonly 
5 referred to as the "source'*, and those parts in which storage occurs are referred to as the 

"sink". In cereals such as maize, rice, and wheat, the primary sink is the endosperm, and 
individual seed weight is the primary determinant of the yield of com (Duvick et al, 
1992). As evidenced by Giroux et al (1996), rendering the maize endosperm AGP 
insensitive to phosphate inhibition, increases individual seed weight without dramatically 
10 affecting starch content (U.S. Patent Nos. 5,650,557 and 5,872,21 6). 

Over the years, the desire for high biological yield has aroused an interest in 
manipulating plant structure in order to obtain plants where the economically usefiil part 
forms are as large a proportion of the plant as is consistent with acceptable plant vigor 
and health. Attempts to increase yield by altering the relative contribution of the different 
1 5 components of grain or kernel yield, such as ears or heads per plant, grains per head or 
kernels per ear, grain size or kernel size, e/c, have proven unsuccessful because increases 
in one component tend to be accompanied by reductions in another (Wilson, D. (1981) 
Plant Breeding H . K. Frey Edited, Iowa, Iowa State University Press, page 255). 
However, yield increases due to an increase in the proportion of grain relative to 
20 vegetative parts have been conunon in the cereal crops (Wilson, D. (1981) Plant Breeding 
IL K. Frey Edited, Iowa, Iowa State University Press, page 255). 

Langer and Hill (Langer, R. H. M. and Hill, G. D. (1991) Agricultural Plants. 
Second Edition. Cambridge, Cambridge University Press, page 341) state that higher 
yields can be achieved by improving the Harvest Index (HI), since HI links biological 
25 yield (¥^^01) and economic yield (Y,,^ J in the following manner : 

^biol X HI = Ygcon. 

It is pointed out that treatments affecting HI will also affect Y^^^ but not necessarily to the 
same extent or in the same direction. For instance, in cereals it is possible to increase 
biological yields by applying nitrogen at high population densities in the presence of 
30 adequate water. The expected result is heavy vegetative growth, but reduced light 

transmission into the canopy, poor grain set and development will lead to a low Harvest 
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Index. In contrast, short-strawed cereals are characterized by greater Harvest Index. 
Short, erect cuhivars of rice yielding 4-5 1 ha'* have been shown to have a Harvest Index 
of about 0.53 to 0.56, compared with 0,39 to 0.42 for tall, leafy cultivars with a grain 
yield of about 2.4 t ha ' (Langer, R. H. M. and Hill, G. D. (1991) Agricultural Plants. 
Second Edition, Cambridge, Cambridge University Press, page 341). Likewise, in wheat, 
the dwarf and semidwarf cultivars emanating from the Mexican plant breeding program 
have a higher Harvest Index. However, short plants may also produce little grain. Thus, 
it is necessary to assess both the biological yield and Harvest Index in plant breeding 
programs. 

The present invention provides a method of increasing the seed production and 
the biomass production of plants. More specifically, this invention provides transgenic 
plants which have increased total seed number, increased individual seed weight, 
increased total seed weight per plant, as well as increased above-ground plant biomass 
and an improved Harvest Index when compared to non-transgenic plants of the same 
genetic background. The production of plants with increases in all of these parameters 
as the result of a transgene is quite unexpected in view of normal source/sink 
relationships in plants. 

Brief Summary of the Invention 

This invention provides methods of producing plants which have improved plant 
production, both plant seed production and plant biomass production. This invention also 
provides the plants produced by the disclosed methods, wherein the plants are 
monocotyledonous plants and dicotyledonous plants. 

More specifically, this invention provides methods for increasing the number of 
seeds produced by plants, increasing the biomass produced by plants, or increasing the 
Harvest Index of plants by introducing into such plants a nucleic acid operably linked to 
a promoter, wherein the nucleic acid is that of SH2'REV6-HS (SEQ DD NO: 3), a nucleic 
acid which hybridizes with SH2-REV6-HS under high stringency conditions and encodes 
a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ ID NO: 
4), a fragment of SH2-REV6-HS encoding a peptide that retains biological activity of 
SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID N0:4, or a 
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fragment thereof that retains biological activity of SH2-REV6-HS, or a nucleic acid 
encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS polypeptide is 
the SH2HS33 polypeptide. The methods further include growing the plants produced by 
such methods. The present invention also includes the plants produced by such methods. 

The methods of the present invention are applicable to monocotyledonous plants, 
such as rice, wheat, barley, oats, sorghum, and millet, and dicotyledonous plants, such 
as peas, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, 
small bumet, soybean, and lettuce. 

This invention also provides methods of increasing the flag leaf weight of 
monocotyledonous plants by introducing into such plants a nucleic acid operably linked 
to a promoter, wherein the nucleic acid is that of SH2-REV6-HS (SEQ ID NO: 3), a 
nucleic acid which hybridizes with SH2'REV6-HS under high stringency conditions and 
encodes a polypeptide that retains biological activity of the protein SH2-REV6-HS (SEQ 
ID NO: 4), a fragment of SH2-REV6-HS encoding a peptide that retains biological 
activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising SEQ ID 
N0:4, or a fragment thereof that retains biological activity of SH2-REV6-HS, or a 
nucleic acid encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS 
polypeptide is the SH2HS33 polypeptide. The methods further include growing the 
plants produced by such methods. The present invention also includes the plants 
produced by such methods. 

The invention also provides methods of increasing the number of seed heads 
produced by monocotyledonous plants by introducing into such plants a nucleic acid 
operably linked to a promoter, wherein the nucleic acid is that of SH2'REV6'HS (SEQ 
ID NO: 3), a nucleic acid which hybridizes with SH2'REV6'HS under high stringency 
conditions and encodes a polypeptide that retains biological activity of the protein SH2- 
REV6-HS (SEQ ID NO: 4), a fragment of SH2'REV6-HS encoding a peptide that retains 
biological activity of SH2-REV6-HS, a nucleic acid encoding a polypeptide comprising 
SEQ ID N0:4, or a fragment thereof that retains biological activity of SH2-REV6-HS, 
or a nucleic acid encoding an SH2HS or an SH2RTS polypeptide. Preferably, the SH2HS 
polypeptide is the SH2HS33 polypeptide. The methods further include growing the 
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The present invention provides plants which include a nucleic acid encoding the 
amino acid sequence of SH2-REV6-HS (SEQ ID NO: 4), or a jfragment thereof that 
retains biological activity of SH2-REV6-HS. 

The present invention provides plants which include a nucleic acid encoding the 
amino acid sequence of an SH2HS or an SH2RTS protein, or a fragment thereof that 
retains biological activity of an SH2HS or an SH2RTS protein. In a preferred 
embodiment, the SH2HS polypeptide has the amino acid sequence of SH2HS33. 

Brief Description of the Drawing 
Figure 1 shows a Northern blot analysis of Sh2'Rev6-HS transgenic rice lines. 

Detailed Description of the Invention 

I. Definitions 

As used herein, the term "AGP" means ADP glucose pyrophosphorylase. 

As used herein, the term "allele" means any of several alternative forms of a gene. 

As used herein, the term "biological activity" means any functional activity of an 
SH2 mutant polypeptide of the invention, such as the SH2-REV6, SH2HS33, and SH2- 
REV6-HS polypeptides. The functional activity of the subject polypeptides includes but, 
is not limited to, increasing total seed number, increasing individual seed weight, 
increasing total seed weight per plant, increasing above-ground plant biomass, increasing 
Harvest Index, and phosphate insensitivity, and increased heat stability. 

As used herein, the term "5r2" means the Brittle-2 gene encoding the small 
subunit of AGP. As used herein, the term means a mutant form of the Bt2 gene, 
which renders the kernels of com brittle in texture upon drying. 

As used herein, the term "cereal" means, depending on the context, either: 1) a 
grass plant, such as com, or 2) the grain of a grass plant. 

As used herein, the term "crop plant" means any plant grown for any conunercial 
purpose, including, but not limited to the following purposes: seed production, grain 
production, hay production, ornamental use, fruit production, berry production, vegetable 
production, oil production, protein production, forage production, silage, animal grazing, 
golf courses, lawns, flower production, landscaping, erosion control, green manure, 
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improving soil tilth/health, producing pharmaceutical products/drugs, producing food 
additives, smoking products, pulp production and wood production. Particular crop 
plants of interest to the present invention include, but are not limited to, wheat, rice, 
maize, barley, rye, sugar beets, potatoes, sweet potatoes, soybeans, cotton, tomatoes, 
5 canola and tobacco. 

As used herein, the term "cross pollination" or "cross-breeding" means the pollen 
of one flower on one plant is applied (artificially or naturally) to the ovule (stigma) of a 
flower on another plant. 

As used herein, the term "cultivar" means a variety, strain or race of plant which 
10 has been produced by horticultural or agronomic techniques and is not normally found 

in wild populations. 

As used herein, the terms "Dicotyledoneae", "dicotyledonous", "dicotyledon" or 
"dicot" are synonymous and mean any of various flowering plants having two embryonic 
seed leaves or cotyledons that usually appear at germination. Examples include, but are 
15 not limited to, tobacco, soybeans, potato, sweet potato, radish, cabbage, rape and apple 
trees. 

As used herein, the term "flag leaf refers to the uppermost leaf on a fruiting 
(fertile) culm; the leaf immediately below the inflorescence or seed head. 

As used herein, the term "genotype" means the genetic makeup of an individual 
20 cell, cell culture, plant, or group of plants. 

As used herein, the term "grain" means, depending on its context, either: 1) the 
cereal grasses considered as a group, or 2) the fruit of one or more of the cereal grasses. 

As used herein, the terms "grass" or "grasses" mean a plant belonging to the 
family Poaceae. 

25 As used herein, the term "Harvest Index" is the proportion of total plant mass 

harvested. It is the ratio of weight of grain/ (weight of grain plus weight of plant). This 
is identical to HI as discussed elsewhere herein (see, also, Langer and Hill, 1991), 
wherein HI links biological yield and economic yield, and HI is the ratio of economic 
yield/biological yield. The economic yield (Y^^on) is the weight of grain, while the 

30 biological yield (Ybioi) is the weight of grain plus weight of plant. The weight of grain is 
synonymous with the total seed weight. 
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As used herein, the term "hetero2ygote" means a diploid or polyploid individual 
cell or plant having different alleles (forms of a given gene) at least at one locus. 

As used herein, the term "heterozygous" means the presence of different alleles 
(forms of a given gene) at a particular gene locus. 
5 As used herein, the term "homozygote" means an individual cell or plant having 

the same alleles at one or more loci. 

As used herein, the term "homozygous" means the presence of identical alleles 
at one or more loci in homologous chromosomal segments. 

As used herein, the term "hybrid" means any individual plant resulting from a 
10 cross between parents that differ in one or more genes. 

As used herein, the term "inbred" or "inbred line" means a relatively true- 
breeding strain. 

As used herein, a nucleic acid molecule is said to be "isolated" when the nucleic 
acid molecule is substantially separated from contaminant nucleic acid encoding other 
15 polypeptides from the source of nucleic acid. 

As used herein, the term "line", when directed to a type of plant, means self- or 
cross-fertilizing plants and single-hne facultative apomicts, having largely the same 
genetic background, that are similar in essential and distinctive characteristics. 

As used herein, the term "locus" (plural: "loci") means any site that has been 
20 defined genetically. A locus may be a gene, or part of a gene, or a DNA sequence that 
has some regulatory role, and may be occupied by different sequences. 

As used herein, the term "mass selection" means a form of selection in which 
individual plants are selected and the next generation propagated from the aggregate of 
their seeds. 

25 As used herein, the terms "Monocotyledoneae", "monocotyledonous", 

"monocotyledon" or "monocot" are synonymous and mean any of various flowering 
plants having a single cotyledon in the seed. Examples of monocots include, but are not 
limited to, rice, wheat, barley, maize and lilies. 

As used herein, the term "Northern Blot" refers to the analysis of RNA by 

30 electrophoresis of RNA on agarose gels to fractionate the RNA according to size 

followed by transfer of the RNA from the gel to a solid support, such as nitrocellulose 
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or a nylon membrane. The immobilized RNA is then probed with a labeled probe to 
detect RNA species complementary to the probe used. Northem blots are a standard tool 
of molecular biologists (Sambrook et aL, Molecular C loning: A Laboratoiy Manual. 2nd 
edition, Cold Spring Harbor Laboratory Press, 1985). 

As used herein, the term "open pollination" means a plant population that is freely 
exposed to some gene flow, as opposed to a closed one in which there is an effective 
barrier to gene flow. 

As used herein, the terms "open-pollinated population" or "open-pollinated 
variety" mean plants normally capable of at least some cross-fertilization, selected to a 
standard, that may show variation but that also have one or more genotypic or phenotypic 
characteristics by which the population or the variety can be differentiated from others. 
A hybrid which has no barriers to cross-pollination is an open-pollinated population or 
an open-pollinated variety. 

As used herein, the term "ovule" means the female gametophyte, whereas the 
term "pollen" means the male gametophyte. 

As used herein, the term "phenotype" means the observable characters of an 
individual cell, cell culture, plant, or group of plants which results from the interaction 
between that individual's genetic makeup {i.e., genotype) and the environment. 

As used herein, the term "progeny" means the descendants of a particular plant 
(self-cross) or pair of plants (crossed or backcrossed). The descendants can be of the Fj, 
the Fj, or any subsequent generation. Typically, the parents are the pollen donor and the 
ovule donor which are crossed to make the progeny plant of this invention. Parents also 
refer to F, parents of a hybrid plants of this invention (the F, plants). Finally, parents 
refer to a recurrent parent which is backcrossed to hybrid plants of this invention to 
produce another hybrid plant of this invention. 

As used herein, the term "Polymerase Chain Reaction" is synonymous with 
"PCR" and refers to techniques in which cycles of denaturation, annealing with 
oligonucleotide primers, and extension with DNA polymerase, are used to amplify the 
number of copies of a target DNA sequence. 

As used herein, the term "revertant" refers to a mutated Sh2 gene {i.e., mutated 
relative to the wild-type Sh2 gene) wherein the mutant results in a wild-type kernel 
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phenotype (/.e, a plump seed, not a shrunken seed like the phenotype displayed by the 
mutant sh2sh2 genotype). A revertant genotype would have more AGP activity than a 
sh2sh2 genotype and may have either more or less AGP activity than a wild-type Sh2 
genotype. Typically, the revertants have a wild-type seed phenotype with at least around 
30% AGP activity compared to that of a nomial non-revertant), wild-type, hi some 
instances, the term "revertant" may refer to the cell or plant which contains the mutated 
Sh2 gene. 

As used herein, the term "rice" means any Oryza species, including, but not 
limited to, O, saliva, O. glaberrima, O. perennis, O. nivara, and O. breviligulata. Thus, 
as used herein, the term "rice" means any type of rice including, but is not limited to, any 
cultivated rice, any wild rice, any rice species, any intra- and inter-species rice crosses, 
all rice varieties, all rice genotypes and all rice cultivars. 

As used herein, the term "self pollinated" or "self-pollination" means the pollen 
of one flower on one plant is applied (artificially or naturally) to the ovule (stigma) of the 
same or a different flower on the same plant. 

As used herein, the term "5A2" refers to the Shrunken-2 gene encoding the large 
subunit of AGP. Sometimes, the term may refer to the cell or plant which contains the 
Sh2 genotype. 

As used herein the term "5/i2" means a mutant form of the Sh2 gene, which 
renders the kernels of com shrunken or collapsed upon drying. Sometimes, the term may 
refer to the cell or plant which contains the sh2 genotype. 

As used herein, the term ''Sh2hs'' refers to mutants of the Shrunken-2 gene which 
encode heat-stable variants of maize endosperai AGP. Sometimes, the term may refer 
to the cell or plant which contains the Sh2hs genotype. The term "SH2HS" refers to 
polypeptides encoded by Sh2hs. A preferred embodiment contemplated by the subject 
invention is the Sh2hs33 gene which encodes the polypeptide referred to herein as 
SH2HS33. The SH2HS33 polypeptide contains the HS33 mutation disclosed in U.S. 
Patent No. 6,069,300 and published PCT application WO 99/58698. Other embodiments 
contemplated for use in the methods of the present invention include, but are not limited 
to, Sh2hsJ3, Sh2hsl4, Sh2hsl6, Sh2hs39, Sh2hs40, and Sh2hs47 polynucleotides which 
encode the polypeptides referred to herein as SH2HS13, SH2HS14, SH2HS16, 
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SH2HS39, SH2HS40, and SH2HS47, respectively. The SH2HS13, SH2HS14, 
SH2HS16, SH2HS39, SH2HS40, and SH2HS47 polypeptides contain the HS13, HS14, 
HS16, HS39, HS40, and HS47 mutations, respectively, that are disclosed in U.S. Patent 
No. 6,069,300 and published PCT application WO 99/58698. 

As used herein, the term '"Shirts'' refers to temperature sensitive revertant mutants 
of the Shrunken-2 gene which encode heat-stable variants of maize endosperm AGP. 
Sometimes, the term may refer to the cell or plant which contains the Shirts genotype. 
The term "SH2RTS" refers to polypeptides encoded by Shirts. Examples of 
embodiments contemplated for use in the methods of the present invention include, but 
are not limited to, Shlrts48'l, and Shlrts60-1 polynucleotides which encode the 
polypeptides referred to herein as SH2RTS48-2 and SH2RTS60-1, respectively. The 
SH2RTS48-2 and the SH2RTS60-2 polypeptides contain the RTS48-2 and RTS60-2 
mutations, respectively, disclosed in U.S. Patent No. 6,069,300 and published PCT 
application WO 99/58698. 

As used herein, the term "'ShlhsSS" refers to a single point mutation in Shi which 
increases the stability of maize endosperm AGP through enhanced subunit interactions. 
The mutation is a change from His-to-Tyr at amino acid position 333 (Greene and 
Hannah, 1998). Sometimes, the term may refer to the cell or plant which contains the 
Shlhs33 genotype. 

As used herein, the term ''Shl'Rev6" is synonymous with '"Shl-ml'Revff' and 
refers to variants of the Shrunken-2 gene. The polypeptide product of the Shl-Rev6 gene 
contains two additional amino acids, tyrosine and serine, inserted between amino acids 
494 and 495 of the wild-type Shi polypeptide. Maize endosperm encoded by Shl-Rev6 
expresses an AGP that is insensitive to phosphate and results in an increased seed weight 
in maize (Giroux et aL, 1996; U.S. Patent Nos. 5,650,557 and 5,872,216). Sometimes, 
the term may refer to the cell or plant which contains the Shl-Rev6 genotype. 

As used herein, the term ""Shl-Revd-HS" is synonymous with ''Shl-mlRevd-HS" 
and refers to a heat stable variant of the Shl-Rev6 gene, wherein His is replaced by Tyr 
at position 333. Sometimes, the term may refer to the cell or plant which contains the 
Sh2-Rev6-HS genotype. The HS33 mutation of maize AGP, along with other mutations 
conferring heat stability, are disclosed in U.S. Patent No. 6,069,300 and published PCT 
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application WO 99/58698 and are specifically contemplated for use in the methods of the 
present invention. 

As used herein, the term ''Sh2hs33'' refers to a specific heat stable genetic variant 
of Sh2. The variant contains a His to Tyr mutation at position 333 of the wild-type maize 
5 Sh2 gene (Greene and Hannah, 1998). The mutation renders the maize endosperm AGP 
activity heat-stable. Sometimes, the term may refer to the cell or plant which contains 
the Sh2hs33 genotype. 

As used herein, the phrase "shrunken and brittle" describes the morphology of 
specific types of kernels of a com. In a brittle and shrunken kernel, the endosperm is 
10 greatly collapsed. The endosperm before drying is like a fluid-filled sac that develops 

with little starch. On drying, the kernel shrinks and collapses into an angular structure 
with marked concavities and brittle texture (Coe et al., 1988). 

As used herein, the term "synthetic" means a set of progenies derived by 
intercrossing a specific set of clones or seed-propagated lines. A synthetic may contain 
15 mixtures of seed resulting from cross-, self-, and sib- fertilization. 

As used herein, the terms "Ti, Tj, T3, . . refer to the succeeding generations of 
cells or plants tracing back to a particular tissue culture-derived or transformed cell line 
designated as Tq, or the parental generation. As regards plants, the plants produced 
directly from the transformed cells are referred to as the Tq generation. The seeds 
20 produced by selfing the Tq generation plants are referred to as the T, seeds. When the Tj 
seeds are germinated, the resulting plants are referred to as the T, generation or the T, 
progeny. Seeds produced by the T, generation are referred to as the Tj seeds. 

As used herein, in grasses, the terni "tiller" means a lateral shoot arising at ground 
level. Each of the tillers that were counted in the present studies had a head on the stem 
25 of the shoot. 

As used herein, the term "transformation" means the transfer of nucleic acid 
a nucleotide polymer) into a cell. As used herein, the term "genetic transformation" 
means the transfer and incorporation of DNA, especially recombinant DNA, into a cell. 
As used herein, the term "transgenic" means cells, cell cultures, plants, and 
30 progeny of plants which have received a foreign or modified nucleic acid sequence by 

one of the various methods of transformation, wherein the foreign or modified nucleic 
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add sequence is from the same or different species than the species of the plant receiving 
the foreign or modified nucleic acid sequence. The foreign or modified nucleic acid used 
to produce such transgenic cells, cell cultures, plants and progeny of such plants includes 
genes, gene fragments as well as nucleic acid sequences which code for a product which 

5 has at least one biological activity or function. As used herein, the terms "transgenic 

plant" and "transformed plant" are synonymous, as are the terms "transgenic line" and 
"transformed line". As used herein, the phrases "corresponding non-transgenic plant" 
and "corresponding non-transgenic line" refer to the cells, cell cultures, plants and 
progeny of plants which did not receive the foreign or modified gene which the 

10 "transgenic" cells, cell cultures, plants and progeny of plants which did receive the 

foreign or modified gene. 

As used herein, the term "variety" means a subdivision of a species, consisting 
of a group of individuals within the species which are distinct in form or function from 
other similar arrays of individuals. 

1 5 As used herein, the term "wheat" means any Triticum species, including, but not 

limited to, T. aestivum, T. monococcum, J. tanschii and T. turgidum. Thus, as used 
herein, the term "wheat" means any type of wheat including, but is not limited to, any 
cultivated wheat, any wild wheat, any wheat species, any intra- and inter-species wheat 
crosses, all wheat varieties, all wheat genotypes and all wheat cultivars. Cultivated 

20 wheats include, but are not limited to, einkom, durum and conmion wheats. 

As used herein, the term "wild-type" refers to the naturally occurring allele of a 
particular gene. Sometimes the terms refers to the cell or plant containing the wild-type 
alleles of the particular gene. 

25 II. Nucleic Acids Encoding Sh2^Rev6 and Sh2-Rev6'HS 

Giroux et al, (1996) isolated and sequenced genomic DNA and cDNA encoding 
Sh2'Rev6. The nucleotide sequence of Sh2'Rev6 is provided in SEQ ID NO: 1 and the 
amino acid sequence of SH2-REV6 is provided in SEQ ID NO: 2 (see, also, U.S. Patent 
No. 5,650,557 and U.S. Patent No. 5,872,216). Com seeds that contain at least one 

30 functional Sh2-Rev6 allele have been deposited with the American Type Culture 

Collection (ATCC), 12301 Parklawn Drive, Rockville, MD, 20852 USA, on May 16, 
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1999 and assigned accession number ATCC 97624 (see, column 5 of U.S. Patent Nos. 
5,650,557 and 5, 872,2 16). 

Sh2'Rev6 was further modified by altering His to Tyr at amino acid position 333 
to produce the variant Sh2-Rev6'HS (Greene and Hannah et al, 1998; U.S. Patent No. 
6,069,300). The nucleotide sequence of Sh2-Rev6-HS is provided in SEQ ID NO: 3 and 
the amino acid sequence of SH2-REV6-HS is provided in SEQ ID NO: 4. 

As used herein, Sh2-Rev6, Sh2hs33, and Sh2'Rev6-HS include the specifically 
identified and characterized variants herein described as well as allelic variants, 
conservative substitution variants and homologues that can be isolated/generated and 
characterized without undue experimentation following methods well known to one 
skilled in the art. 

Homology or identity at the amino acid -or nucleotide level is determined by 
BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed by 
the programs blastp, blastn, blastx, tblastn and tbiastx (Karlin et al., 1990, Proc. Natl. 
Acad. Sci. USA 87, 2264-2268 and Altschul, 1993, J. Mol. EvoL 36. 290-300, fully 
incorporated by reference) which are tailored for sequence similarity searching. The 
approach used by the BLAST program is to first consider similar segments between a 
query sequence and a database sequence, then to evaluate the statistical significance of 
all matches that are identified and finally to summarize only those matches which satisfy 
a preselected threshold of significance. For a discussion of basic issues in similarity 
searching of sequence databases (see Altschul et aL, 1994, Nature Genetics 6, 1 19-129 
which is fully incorporated by reference). The search parameters for histogram, 
descriptions, alignments, expect (/.e., the statistical significance threshold for reporting 
matches against database sequences), cutoff, matrix and filter are at the default settings. 
The default scoring matrix used by blastp, blastx, tblastn, and tbiastx is the 
BLOSUM62 matrix (Henikofifef a/., 1 992. Proc. Natl. Acad. Sci. USA 89. 10915-10919, 
fiiUy incorporated by reference). For blastn, the scoring matrix is set by the ratios of M 
(i.e., the reward score for a pair of matching residues) to N (z.e., the penalty score for 
mismatching residues), wherein the default values for M and N are 5 and -4, respectively. 

The terms ''Sh2-Rev6 genes," ''Sh2'Rev6'HS genes," and "Sh2hs33 genes" include 
all allelic variants of the Sh2'Rev6 genes, Sh2hs33 genes, and Sh2-Rev6-HS genes 
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exemplified herein, wherein such allehc variants code for proteins which result in- one or 
more of the same physiological characteristics as those of the proteins produced by the 
Sh2'Rev6, Sh2hs33, and Sh2-Rev6'HS genes disclosed herein. 

The Sh2'Rev6, Sh2hs33, and Sh2'Rev6-HS nucleic acid molecules or fragments 
thereof utilized in the present invention may also be synthesized using methods known 
in the art. It is also possible to produce the molecule by genetic engineering techniques, 
by constructing DNA using any accepted technique, cloning the DNA in an expression 
vehicle and transfecting the vehicle into a cell which will express the SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins. See, for example, the methods set forth in 
Sambrook et ai. Molecular Cloning: A Laboratory Manual . 2nd edition. Cold Spring 
Harbor Laboratory Press, 1985. 

It is understood that all polynucleotides encoding all or a portion of the 
polypeptides of the present invention, such as the SH2-REV6, SH2HS33, and SH2- 
REV6-HS proteins, are also included herein, as long as they encode a polypeptide with 
one or more of the functional activities of the subject proteins as set forth herein. Thus, 
for example, any polynucleotide fragment having the activities of the SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins discussed herein are encompassed by the present 
invention. 

Polynucleotide sequences of the invention include DNA, cDNA, synthetic DNA 
and RNA sequences which encode polypeptides of the present invention, such as, for 
example, SH2-REV6, SH2HS33, and SH2-REV6-HS proteins. Such polynucleotides 
also include naturally occurring, synthetic and intentionally manipulated polynucleotides. 
For example, such polynucleotide sequences may include genomic DNA which may or 
may not include naturally occurring introns. Moreover, such genomic DNA may be 
obtained in association with promoter regions or poly A sequences. As another example, 
portions of the mRN A sequence may be altered due to ahemate RNA splicing patterns 
or the use of altemate promoters for RNA transcription. As yet another example, 
Rev6, Sh2hs33, and Sh2-Rev6-HS polynucleotides may be subjected to additional 
mutations using, for example, site-directed mutagenesis and DNA shuffling. 

The polynucleotides of the invention further include sequences that are degenerate 
as a result of the genetic code. The genetic code is said to be degenerate because more 
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than one nucleotide triplet can code for the same amino acid. There are 20 natural amino 
acids, most of which are specified by more than one codon. It will be appreciated by 
those skilled in the art that as a result of the degeneracy of the genetic code, a multitude 
of nucleotide sequences, some bearing minimal nucleotide sequence homology to the 
nucleotide sequences of the subject polynucleotides, such as Sh2-Rev6, Sh2hs33, and 
Sh2'Rev6-HS, may be utilized in the present invention. Therefore, all degenerate 
nucleotide sequences are included in the invention as long as the amino acid sequence of 
the subject polypeptides, for example, the SH2-REV6, SH2HS33, and SH2-REV6-HS 
polypeptides, encoded by the nucleotide sequence are functionally unchanged or 
substantially similar in function. The invention specifically contemplates each and every 
possible variation of peptide or nucleotide sequence that could be made by selecting 
combinations based on the possible amino acid and codon choices made in accordance 
with the standard triplet genetic code as applied to polynucleotide sequences of the 
invention, as exemplified by Sh2'Rev6, Sh2hs33, and Sh2'Rev6-HS, and all such 
variations are to be considered specifically disclosed herein. 

Also included in the invention are fragments (portions, segments) of the 
sequences disclosed herein which selectively hybridize to polynucleotides of the present 
invention, such as, for example, Sh2'Rev6, Sh2hs33, md Sh2'Rev6'HS, Selective 
hybridization as used herein refers to hybridization under stringent conditions (See, for 
example, the techniques in Maniatis et al (1989) Molecular Cloning: A Laboratorv 
Manual . Cold Spring Harbor Laboratory Press), which distinguishes related from 
unrelated nucleotide sequences. The active fragments of the invention, which are 
complementary to mRNA and the coding strand of DNA, are usually at least about 15 
nucleotides, more usually at least 20 nucleotides, preferably 30 nucleotides and more 
preferably may be 50 nucleotides or more. 

"Stringent conditions" are those that (I) employ low ionic strength and high 
temperature for washing, for example, 0.5 M sodium phosphate buffer pH 7.2, 1 mM 
EDTA pH 8.0 in 7% SDS at either 65°C or 55°C, or (2) employ during hybridization a 
denamring agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% 
bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 0.05 M sodium 
phosphate buffer at pH 6.5 with 0.75 M NaCl, 0.075 M sodium citrate at 42°C. A 
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specific example includes the use of 50% foimamide, 5x SSC (0.75 M NaCl, 0.075 M 
sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5x 
Denhardt's solution, sonicated salmon sperm DNA (50 ^ig/ml), 0.1% SDS, and 10% 
dextran sulfate at 55'*C, with washes at 55°C in 0.2x SSC and 0.1% SDS. A skilled 
artisan can readily determine and vary the stringency conditions appropriately to obtain 
a clear and detectable hybridization signal. Preferred molecules are those that hybridize 
under the above conditions to the complements of Sh2'Rev6, Sh2hs33, and Sh2'Rev6-HS 
and which encode a functional protein. 

The present invention utilizes nucleic acid molecules encoding the subject SH2 
mutant proteins, such as SH2-REV6, SH2HS33, and SH2-REV6-HS, which hybridize 
with nucleic acid molecules comprising sequences complementary to the subject 
polynucleotides encoding SH2-REV6, SH2HS33, and SH2-REV6-HS under conditions 
of sufficient stringency to produce a clear signal. As used herein, "nucleic acid" is 
defined as RNA or DNA encoding polypeptides of the invention, such as, for example, 
SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides, or RNA or DNA sequences 
which are complementary to nucleic acids encoding such peptides, or RNA or DNA 
sequences which hybridize to such nucleic acids and remain stably bound to them under 
stringent conditions, or RNA or DNA sequences which encode polypeptides sharing at 
least 60% sequence identity, or at least 65% sequence identity, or at least 70% sequence 
identity, or at least 75% sequence identity, or at least 80% sequence identity, or at least 
. 85% sequence identity, preferably at least 90% sequence identity, and more preferably 
at least 95% sequence identity with proteins of the present invention, such as SH2-REV6, 
SH2HS33, and SH2-REV6-HS. 

The present invention further provides fragments of any one of the encoding 
nucleic acids molecules. As used herein, a fragment of an encoding nucleic acid 
molecule refers to a small portion of the entire protein coding sequence. The size of the 
fragment will be determined by the intended use. For example, if the fragment is chosen 
so as to encode an active portion of the protein, the fragment will need to be large enough 
to encode the functional region(s) of the protein. For instance, fragments of the invention 
encode the domains or regions of the SH2-REV6, SH2HS33, and SH2-REV6-HS of the 
present invention which are involved with the allosteric regulation of AGP. If the 
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fragment is to be used as a nucleic acid probe or PCR primer, then the fragment length 
is chosen so as to obtain a relatively small number of false positives during probing and 
priming. 

Fragments of the encoding nucleic acid molecules of the present invention (z,e., 

5 synthetic oligonucleotides) that are used as probes or specific primers for the polymerase 
chain reaction (PCR), or to synthesize gene sequences encoding proteins of the invention 
can easily be synthesized by chemical techniques, for example, the phosphotriester 
method of Matteucci et al. (1981) J. Am. Chem. Soc. 103, 3185-3191) or using 
automated synthesis methods. In addition, larger DNA segments can readily be prepared 

10 by well known methods, such as synthesis of a group of oligonucleotides that define 
various modular segments of the gene, followed by ligation of oligonucleotides to build 
the complete modified gene. 

The encoding nucleic acid molecules of the present invention may fiirther be 
modified so as to contain a detectable label for diagnostic and probe purposes. A variety 

15 of such labels are known in the art and can readily be employed with the encoding 
molecules herein described. Suitable labels include, but are not limited to, biotin, 
radiolabeled nucleotides and the like. A skilled artisan can employ any of the art known 
labels to obtain a labeled encoding nucleic acid molecule. 

Modifications to the primary structure itself by deletion, addition, or alteration of the 

20 amino acids incorporated into the protein sequence during translation can be made without 
destroying the activity of the protein. Such substitutions or other alterations result in proteins 
having an amino acid sequence encoded by a nucleic acid falling within the contemplated 
scope of the present invention. 

25 m. Isolation of Oth er Related Nucleic Acid Molecules 

As described herein, the identification and characterization of the nucleic acid 
molecules of the present invention, such as those encoding an SH2-REV6, SH2HS33, or 
SH2-REV6-HS protein, or a fragment of an SH2-REV6, SH2HS33, or SH2-REV6-HS 
protein, allows a skilled artisan to isolate nucleic acid molecules that encode other members 

30 of the protein family in addition to the sequences herein described. Further, the presently 
disclosed nucleic acid molecules allow a skilled artisan to isolate nucleic acid molecules that 
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encode other members of the family of proteins in addition to the SH2-REV6, SH2HS33, and 
SH2-REV6-HS disclosed herein. 

Essentially, a skilled artisan can readily use any one of the amino acid sequences 
disclosed herein to generate antibody probes to screen expression libraries prepared from 
appropriate cells. Typically, polyclonal antiserum from mammals such as rabbits inmiunized 
with the purified protein or monoclonal antibodies can be used to probe a cDN A or genomic 
expression library to obtain the appropriate coding sequence for other members of the protein 
femily. The cloned cDNA sequence can be expressed as a fiasion protein, expressed directly 
using its own control sequences, or expressed by constructions using control sequences 
appropriate to the particular host used for expression of the enzyme. 

Alternatively, a portion of the coding sequence herein described can be synthesized 
and iised as a probe to retrieve DNA encoding a member of the protein family from any 
organism. Oligomers containing approximately 18-20 nucleotides (encoding about a six to 
seven amino acid stretch) are prepared and used to screen genomic DNA or cDN A libraries 
to obtain hybridization under stringent conditions or conditions of sufl5cient stringency to 
eliminate an undue level of false positives. 

Additionally, pairs of oligonucleotide primers an be prepared for use in a polymerase 
chain reaction (PGR) to selectively clone an encoding nucleic acid molecule. A PGR 
denature/anneal/extend cycle for using such PGR primers is well known in the art and can 
readily be adapted for use in isolating other encoding nucleic acid molecules. 

IV. Production of R ecombinant Proteins Using a Recombinant DNA (rPNA^ Molecule 

The present invention fiirther provides methods for producing polypeptides of the 
invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS using the. nucleic acid 
molecules herein described. In general terms, the production of a recombinant form of a 
protein typically involves the following steps: First, a nucleic acid molecule is obtained that 
encodes, for example, an SH2-REV6, SH2HS33, and SH2-REV6-HS protein, or a fragment 
of an SH2-REV6, SH2HS33, and SH2-REV6-HS protein. If the encoding sequence is 
xminterrupted by introns, it is directiy suitable for expression in any host. The nucleic acid 
molecule is then preferably placed in operable linkage with suitable control sequences, as 
described above, to form an expression unit containing the protein open reading frame. The 
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expression unit is used to transform a suitable host and the transformed host is cultured under 
conditions that allow the production of the recombinant protein. Optionally the recombinant 
protein is isolated from the medium or from the cells; recoveiy and purification of the protein 
may not be necessary in some instances where some impurities may be tolerated. 

5 Each of the foregoing steps can be done in a variety of ways. For example, the 

desired coding sequences may be obtained from genomic fragments and used directly in 
appropriate hosts. The construction of expression vectors that are operable in a variety of 
hosts is accomplished using appropriate replicons and control sequences, as set forth above. 
The control sequences, expression vectors, and transformation methods are dependent on the 

10 type of host cell used to express the gene and were discussed in detail earlier. Suitable 
restriction sites can, if not normally available, be added to the ends of the coding sequence 
so as to provide an excisable gene to insert into these vectors. A skilled artisan can readily 
adapt any host-expression system known in the art for use with the nucleic acid molecules of 
the invention to produce recombinant protein. 

15 

V. SH2-REV6. SH2HS33. and SH2-REV6-HS Proteins 

As used herein, an SH2-REV6, SH2HS33, and SH2-REV6-HS protein refers to a 
protein that has the amino acid sequence encoded by the polynucleotide of SH2-REV6, 
. SH2HS33, and SH2-REV6-HS, allelic variants thereof and conservative substitutions thereof 

20 that have SH2-REy6, SH2HS33, and SH2-REV6-HS activity. In addition, the polypeptides 
utilized in the present invention include the proteins encoded by SH2'REV6, SH2HS33, and 
SH2'REV6'HS, as well as polypeptides and fragments, particularly those which have the 
biological activity of SH2-REV6, SH2HS33, and SH2-REV6-HS and also those which have 
at least 65% sequence identity to the polypeptides encoded by SH2-REV6, SH2HS33, and 

25 SH2-REV6-HS or the relevant portion, or at least 70% identity, or at least 75% identity, or 
at least 80% identity, or at least 85% identity to the polypeptides encoded by SH2-REV6, 
SH2HS33, and SH2-REV6-HS or the relevant portion, and more preferably at least 90% 
sequence identity to the polypeptides encoded by SH2-REV6, SH2HS33, and SH2-REV6-HS 
or the relevant portion, and still more preferably at least 95% sequence identity to the 

30 polypeptides encoded by SH2-REV6, SH2HS33, and SH2-REV6-HS or the relevant portion, 
and also include portions of such polypeptides. One of skill will recognize whether an amino 
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acid sequence of interest is within a functional domain of a protein, such as the domain or 
region of the SH2-REV6, SH2HS33, and SH2-REV6-HS involved in the allosteric regulation 
of AGP. Thus, it may be possible for a homologous protein to have less than 40% homology 
over the length of the amino acid sequence but greater than 90% homology in one functional 
5 domain. 

The SH2-REV6, SH2HS33, and SH2-REV6-HS proteins utilized in the present 
invention include the specifically identified and characterized variants herein described as 
well as allelic variants, conservative substitution variants and homologues that can be 
isolated/generated and characterized without undue experimentation following the methods 

1 0 well known to one skilled in the art. 

The term "substantially pure" as used herein refers to polypeptides of the present 
invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides, which are 
substantially free of other proteins, lipids, carbohydrates or other materials with which they 
are naturally associated. One skilled in the art can purify the subject polypeptides using 

1 5 standard techniques for protein purification. 

The invention also utilizes amino acid sequences coding for isolated polypeptides of 
the invention, such as the SH2-REV6, SH2HS33, and SH2-REV6-HS polypeptides. The 
polypeptides of the invention include those which differ from the exemplified SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins as a result of conservative variations. The terms 

20 "conservative variation" or "conservative substitution" as used herein denotes the replacement 

of an amino acid residue by another, biologically similar residue. Conservative variations or 
substitutions are not likely to change the shape of the polypeptide chain. Examples of 
conservative variations, or substitutions, include the replacement of one hydrophobic residue 
such as isoleucine, valine, leucine or methionine for another, or the substitution of one polar 

25 residue for another, such as the substitution of arginine for lysine, glutamic -for aspartic acid, 
or glutamine for asparagine, and the like. Therefore, all conservative substitutions are 
included in the invention as long as the subject polypeptides encoded by the nucleotide 
sequence are functionally unchanged or similar. 

As used herein, an isolated polypeptide of the present invention, such as an SH2- 

30 REV6, SH2HS33, and SH2-REV6-HS protein, can be a fulHengfli or any homologue of such 
proteins, such as, for example, SH2-REV6, SH2HS33, and SH2-REV6-HS proteins in which 
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amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide), 
inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, 
acetylation, myristoylation, prenylation, paknitoylation, amidation and/or addition of 
glycosylphosphatidyl inositol). Such modified proteins include those that retain at least one 
of the functional activities of the subject proteins or produce at least one of the physiological 
characteristics produced as a result of the expression of the subj ect proteins. A homologue 
of the subject proteins is a protein having an amino acid sequence that is sufficiently similar 
to the subject proteins, such as the SH2-REV6, SH2HS33, and SH2-REV6-HS protein amino 
acid sequences, that a nucleic acid sequence encoding the homologue is capable of 
hybridizing under stringent conditions to with) a nucleic acid sequence encoding the 
subject proteins (e.g., SH2-REV6, SH2HS33, and SH2-REV6-HS protein amino acid 
sequences). Appropriate stringency requirements are discussed above. 

The subject protein homologues, including SH2-REV6, SH2HS33, and SH2-REV6- 
HS protein homologues, can be the result of allelic variation of a gene encoding the protein. 
For example, SH2-REV6, SH2HS33, and SH2-REV6-HS protein homologues can be 
produced using techniques known in the art including, but not limited to, direct modifications 
to a gene encoding a protein using, for example, classic or recombinant DNA techniques to 
effect random or targeted mutagenesis. 

Minor modifications of the primary amino acid sequence of a protein of the present 
invention may result in proteins which have substantially equivalent activity as compared to 
the subject proteins (e.g., SH2-REV6, SH2HS33, and SH2-REV6-HS) produced by the genes 
described herein. As used herein, a "fiinctional equivalent" of a subject protein is a protein 
which possesses a biological activity or immimological characteristic substantially similar to 
a biological activity or immunological characteristic of the subject protein. The term 
"fiinctional equivalent" is intended to include the fiagments, variants, analogues, homologues, 
or chemical derivatives of a molecule which possess the biological activity of proteins, such 
as, SHI-REVe, SH2HS33, and SH2-REV6-HS, encoded by the genes of the present 
invention. 

The terms "SH2-REV6, SH2HS33, and SH2-REV6-HS proteins," "SH2-REV6 
proteins," "SH2HS33 proteins," and "SH2-REV6-HS proteins" include all allelic variants of 
these proteins that possess normarSH2-REV6,SH2HS33, and SH2-REV6-HS activity. In 



wo 01/64928 



PCTAJSOl/06622 



25 

general, allelic variants of SH2-REV6, SH2HS33, and SH2-REV6-HS proteins will have 
slightly different amino acid sequence than that specifically encoded by the genes utilized m 
the present invention but will be able to produce the exemplified phenotypes. Allelic 
variants, though possessing a slightly different amino acid sequence than those recited 
individual a above, will posses the ability to produce a phenotype which exhibits increased 
individual and total seed weight, increased seed number, mcreased Harvest Index (HI) and 
increased above-ground plant mass. 

The methods of the present invention can be used by one skilled in the art to produce 
plants with increased individual and total seed weight, increased seed number, increased 
Harvest Index and increased total plant mass. 

Applicants further teach methods of recognizing variations in the DNA sequences of 
. polynucleotides, such as Sh2-Rev6, Sh2hs33, and Sh2-Rev6-HS, of the present invention. One 
method involves the introduction of a nucleic acid molecule (also known as a probe) having 
a sequence complementary to, for example, an Sh2-Rev6, Sh2hs33, or Sh2'Rev6-HS gene, 
utilized in the invention under sufficient hybridizing conditions, as would be understood by 
those in the art. Another method of recognizing DNA sequence variation associated with 
polynucleotides of the present invention, including Sh2'Rev6, Sh2hs33, and Sh2-Rev6-HS, 
is direct DNA sequence analysis by multiple methods well known in the art. Another 
embodiment involves the detection of DNA sequence variation in the subject polynucleotides 
as represented by different plant genera, species, strains, varieties or cultivars. Polynucleotide 
sequences of the invention, for example, Sh2'Rev6, Sh2hs33, and Sh2-Rev6-HS, can be used 
as probes to detect the presence of corresponding genes in other plants. As discussed 
previously, Sh2-Rev6, Sh2hs33, and Sh2'Rev6'HS sequences have been determined and are 
readily available to one of ordinary skill in the art. In one embodiment, the sequences will 
bind specifically to one allele of an Sh2'Rev6, Sh2hs33, or Sh2'Rev6-HS gene, or a fiBgment 
thereof, and in another embodiment will bind to multiple alleles. Such detection methods 
include the polymerase chain reaction, restriction fiBgment length polymorphism (RFLP) 
analysis and single stranded conformational analysis. 

Diagnostic probes useful in such assays of the invention include antibodies to 
polypeptides of the present invention, such as SH2-REV6, SH2HS33, and SH2-REV6-HS. 
The antibodies may be either monoclonal or polyclonal, produced using standard techniques 
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well known in the art (See Harlow & Lane's Antibodies: A Laboratory Manual . Cold Spring 
Harbor Laboratory Press, 1988). Antibodies can be xised to detect a protein of the invention 
by binding to the protein and subsequent detection of the antibody-protein complex by 
ELISA, Western blot, or the like. Antibodies are also produced from peptide sequences of 
the subject proteins, such as SH2-REV6, SH2HS33, and SH2-REV6-HS, using standard 
techniques in the art (See Protocols in Immunology . John Wiley & Sons, 1 994). Fragments 
of the monoclonals or the polyclonal antisera which contain the immunologically significant 
portion can also be prepared. 

Assays to detect or measure the subject polypeptides, for example, SH2-REV6, 
SH2HS33, and SH2-REV6-HS polypeptides, in a biological sample with an antibody probe 
may be based on any available format. For instance, in immunoassays where SH2-REV6, 
SH2HS33, or SH2-REV6-HS polypeptides are the analyte, the test sample, typically a 
biological sample, is incubated with anti-SH2-REV6, anti-SH2HS33, or anti-SH2-REV6-HS 
antibodies under conditions that allow the formation of antigen-antibody complexes. Various 
fomiats can be employed, such as "sandwich" assay where antibody bound to a solid support 
is incubated with the test sample; washed, incubated with a second, labeled antibody to the 
analyte; and the support is washed again. Analyte is detected by determining if the second 
antibody is bound to the support. In a competitive format, which can be either heterogeneous 
or homogeneous, a test sample is usually incubated with an antibody and a labeled competing 
antigen, either sequentially or simultaneously. These and other formats are well known in the 
art. 

VI. Transformation Methods 

Methods of producing transgenic plants are well known to those of ordinary skill in 
the art. Transgenic plants can now be produced by a variety of different transformation 
methods including, but not limited to, electroporation; microinjection; microprojectile 
bombardment, also known as particle acceleration or biolistic bombardment; viral-mediated 
transformation; and Agrobacterium-mediated transformation (see, e.g., U.S. Patent Nos. 
5,405,765, 5,472,869, 5,538,877, 5,538,880, 5,550,318, 5,641,664, 5,736,369 and 5,736369; 
Watson et ai (1992) Recombinant DN A . Scientific American Books; Hinchee et ai (1988) 
Bio/Tech . 6:915-922 (1988); McCabe a/., Bio/IedL 6:92 Toriyamaefa/. (1988) 
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Bio/Tech . 6:1072-1074; Froimn et ai (1990) Eiflfljseh. 8:833-839; MuUins et ai (1990) 
BioyTech. 8:833-839; and, Raineri et al (1990) Bio/Tech . 8:33-38). 

A. Agrobacteriu m-Mediated Transformation 

Agrobacterium-mediated transformation is the most widely utilized method for 
introducing an expression vector into plants (Horsch et aL (1985) Science 227:1229). A. 
tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform 
plant cells. The Ti and Ri plasmids of A, tumefaciens and A. rhizogenes, respectively, carry 
genes responsible for genetic transformation of the plant. (Kado,C.l. f 199n Crit. Rev. Plant. 
Sci . 10:1). Descriptions of Agrobacterium vector systems and methods for 
Agrobacterium-mediated gene transfer are provided by Gruber et aL (1993) "Vectors for 
Plant Transformation" in Methods in Plant Molecular Biology and Biotechnology, Glick, 

B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton), pages 89-1 19), Miki et al 
(1993) "Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular 
Biology and Biotechnology . Click, B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca 
Raton) pages 67-88, and Moloney et al (1989) Plant Cell Reports 8:238. 

Agrobacterium-mediated transformation methods have been used principally to 
transform dicotyledonous plants. Agrobacterium-mediated transformation in dicotyledons 
facilitates the delivery of larger pieces of heterologous nucleic acid as compared with other 
transformation methods such as particle bombardment, electroporation, and polyethylene 
glycol-mediated transformation method. In addition, Agrobacterium-mediated transformation 
appears to result in relatively few gene rearrangements and more typically results in the 
integration of low numbers of gene copies into the plant chromosome. 

Monocotyledons are not a natural host of Agrobacterium. Although 
Agrobacterium-mediated transformation has been reported for asparagus (Bytebier et al 
n9R7 ^ Proc. Natl Acad. Sci. USA 84:5354-5349) and for Dioscore bublifera (Schafer et al 
(1987) Nature 327:529-532), it was generally beheved that plants in the family Gramineae 
could not be transfonned with Agrobacterium (Potrykus L (1987) Biotechnology 8:535-543). 
However, recently in U.S. Patent No. 5,981,840, Zhao et al disclosed agrobacterium- 
mediated transformation in maize. The method of Zhao et al includes the following steps: 
contacting at least one immature embryo from a maize plant with Agrobacterium capable of 
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transferring at least one gene to said embryo; co-cultivating the embryos with Agrobacterium; 
culturing the embryos in medium comprising N6 salts, an" antibiotic capable of inhibiting the 
growth of Agrobacterium, and a selective agent to select for embryos expressing the gene; 
and regenerating plants expressing the gene. 

5 

B. Microprojectile-Mediated Transformation 

In a microprojectile bombardment process, also referred to as a biolistic process, the 
transport of the DNA is mediated by very small particles of a biologically inert material. 
When the inert particles are coated with DNA and accelerated to a suitable velocity, one or 

1 0 more of the particles is able to enter into one or more of the cells where the DNA is released 

from the particle and expressed within the cell While some of the cells are fatally damaged 
by the bombardment process, some of the recipient cells do survive, stably retain the 
introduced DNA, and express it. Sanford et al provides a general description of a suitable 
particle bombardment instmment (Sanford et al (1987) Particulate Sci. Technol . 5: 27-37). 

1 5 Microprojectile bombardment process has been used to successfully introduce genes 

encoding new genetic traits into a nimiber of plants, including onion, cotton, maize, tobacco, 
rice, wheat, sunflowers, soybeans and certain vegetables (U.S. Pat. No. 4,945,050; Sanford 
et al (1988) Trends in Biotechnologv 6:299; Sanford et al (1988) Part. Sci. Technol. 5:27; 
J. J. Finer and M. D. McMullen (1990) Plant Cell Reports 8:586-589; and Gordon-Kamm 

20 (1990) The Plant Cell 2:603; Klein et al (1988) Proc. Nat. Acad. Sci. USA 85:4305-4309). 

Although transformation by microprojectile bombardment is less species and genotype 
specific than transformation with Agrobacterium, the frequencies of stable transformation 
events achieved following bombardment can be quite low, partly due to the absence of a 
natural mechanism for mediating the integration of a DNA molecule or gene responsible for 

25 a desired phenotypic trait into the genomic DNA of a plant. Particle gun transformation of 
cotton for example, has been reported to produce no more than one clonal transgenic plant 
per 100-500 meristems targeted for transformation. Only 0.1 to 1% of these transformants 
were capable of transmitting foreign DNA to progeny (WO 92/15675). Cells treated by 
particle bombardment must be regenerated into whole plants, which requires labor intensive, 

30 sterile tissue culture procedures and is generally genotype dependent in most crop plants, 

particularly so in cotton. Similar low transformation frequencies have been reported for other 
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plant species as well. Other disadvantages of microprojectile bombardment include the 
inability to control the site of woiinding of a plant tissue and thus, the site to which the 
transforaiing agent is delivered. The inability to target germline tissues accounts in part for 
the low transformation efficiencies achieved by microprojectile bombardment. Additionally, 
5 bombardment frequently results in the delivery of more than one copy of the transforming 
DNA or gene into the genome of the transfonned plant cell, which can have deleterious 
effects on the regenerated, transformed plant. Fragmentation of the DNA to be inserted can 
also occur when transforming DNA via microprojectile bombardment, resulting in transgenic 
plants with only a portion of the gene that is being inserted. 
1 0 Attempts have been made to improve the efficiency of microprojectile bombardment. 

For example, EPA 0486 233 describes treating bombarded tissues with Agrobacterium 
carrying the gene of interest. It is thought that the high velocity impact of the dense 
microprojectile particles generates an array of microwounds creating an environment that is 
particularly conducive to infection by the Agrobacterium. However, the transformed plant 
15 cells must still be regenerated into whole plants, and the fertile, stably transfonned plants 
must be selected from the total popxilation of regenerated plants. Organogenesis and somatic 
embryogenesis have been used to regenerate plants. Nonetheless, organogenesis frequently 
produces chimeric plant containing both transformed and nontransformed cells, and somatic 
embryogenesis, although superior to organogenesis is highly genotype dependent in most 
20 crop plants. 

Efforts have been made to deliver the transforming agent or DNA to germline tissues 
such that the agent or DNA will be incorporated directly into the DNA of the cells in these 
tissues, particularly into the DNA of the egg cells of the plant. In U.S. Patent No. 5,994,624, 
Trolinder et ai describes a method of implanta transformation which provides an improved 
25 method for delivering transforming agents to plant tissues. The method uses a 
needleless-injection device that is capable of injecting a small high pressure stream of a 
solution through the many cell layers of plant tissue. The transforming agent is delivered to 
a plant's floral tissues, thereby facilitating delivery of a transfonning agent comprising a gene 
of interest into germline cells of the plant. The high pressure stream provided by the injection 
30 device insures that the Agrobacterium culture or the DNA solution penetrates the many cell 
layers of the plant floral tissue without causing massive tissue damage, such as that caused 
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by direct injection with a syringe having a needle or by particle bombardment. The method 
can be used to transform plant cells and tissues, including embryonic tissue culture cells, 
meristematic tissues and plant callus, which can be regenerated into whole plants. Moreover, 
the method can be used to transform plant cells and tissues selected from the group consisting 

5 of cotton, soybean, alfalfa, flax, tobacco, sunflower, peanut, strawberry, tomato, pea, bean, 
squash, pepper, maize, sorghum, barley, oat, rye, wheat, rice, brassica, and potato. 

Although Klein et al (Klein et al (1988) Proc. Nat Acad. Sci. USA 85:4305^309; 
Klein et al (1988) Bio/Technol . 6:59-563; Klein et al (1989) Plant Phvsiol . 91:440-444) 
provides protocols for bombardment of maize non-regenerable suspension culture cells, no 

1 0 protocols have been published for the bombardment of callus cultures or regenerable maize 
cells until recently. Lundquist et al (U.S. Patent No. 6,013,863) describes delivery of DNA 
into regenerable maize callus cultures via particle bombardment process which results in high 
level of viability for a few transformed cells. The method maybe appHcable to producing 
fertile stably transgenic plants of other graminaceous cereals. Dwight et al (U.S. Patent No. 

1 5 5,990,387) discloses a method of producing fertile, stably transformed, Zea mays plant. The 
methods comprise the following steps: providing a foreign DNA comprising an expression 
vector carrying a gene encoding an agronomic trait; providing a maize embryogenic callus, 
suspension culture, or immature embryo isolated from a plant; introducing the foreign DNA 
into the embryogenic callus, suspension culture or immature embryo isolated from a plant by 

20 one or more microparticle bombardments; and regenerating fertile transgenic Zea mays plant. 

Plants that can be successfiilly transformed by the method of Dwight et al include maize, rye, 
barley, wheat, sorghum, oats, millet, rice, sunflower, alfalfa, rape seed and soybean.. 

Biswas et al describes generation of transgenic rice plants by microprojectile 
bombardment of embryogenic cell clusters (Biswas etal{l 998) Plant Science . 1 33 :203-2 1 0), 

25 and Yao et al discloses the production of transgenic barley plants via direct dehvery of 
plasmid DNA into microspores of barley using high velocity microprojectiles (Y ao et al 
(1997) Genome. 40:570-581). Christou et al reports on the parameters that influence stable 
transformation of rice embryogenic callus and the recovery of transgenic plants using electric 
discharge particle acceleration (Christou et al (1995) Annals of Botany 75:407-413). 
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C. Alternative Methods of Transformation 

Other methods for physical delivery of DNA to plants include sonication of target 
cells (Zhang et al (1991) Bio/Technologv 9:996) and liposome or spheroplast fusion 
(Deshayes etal (1985)EliffiaL,4: 2731, Christou e/a/, ri987^ ProcNatl. Acad Sci. USA 
84: 3962). Direct uptake of DNA into protoplasts using CaCl j precipitation, polyvinyl 
alcohol or poly-L-omithine has also been reported (Hain et al (1985) Mol. Gen. Genet . 199: 
161 and Draper et al (1982) Plant Cell Phvsiol. 23: 451). Nobre et al reports the 
regeneration of fertile transgenic plants of barley using PEG-mediated transformation of 
scutellum protoplast (Nobre et al (1997) Barley Genetics Newsletter . 27:16-17). 
Electroporation of protoplasts and whole cells and tissues has also been described (Donn et 
al Abstracts of VUth Intemational Congress on Plant Cell and Tissue Culture lAPTC . 

A2-38, p 53; D'Halluin et al (1992) Plant Cell 4: 1495-1505 and Spencer et al (1994) Plant 
Mol. Biol . 24: 51-61). In fact, D'Halluin etal (U.S. Patent No. 6,002,070) describes a rapid 
and efiHcient method of transforming monocotyledonous plants by electroporation. The 
method of D*Halluin comprises electroporation of DNA of interest into either intact tissue 
capable of forming compact embryogenic callus or compact embryogenic callus obtained 
from intact tissue. 

Another technology for production of transgenic plants is whisker-mediated 
transformation whereby certain materials, when incubated with plant tissue, facilitate entry 
of DNA molecules into plant cells. It has been proposed that such materials that promote 
DNA uptake, primarily silicone carbide, do so by damaging the cell surface. For a review, 
see Wang et al (1995) In Vitro Cell. Dev. Biol. 34: 101-4. 

Vn. Transgenes 

Genes successfully introduced into plants using recombinant DNA methodologies 
include, but are not limited to, those coding for the following traits: seed storage proteins, 
including modified 7S legume seed storage proteins (U.S. Patent Nos. 5,508,468, 5,559,223 
and 5,576,203); herbicide tolerance or resistance (U.S. Patent Nos. 5,498,544 and 5,554,798; 
Powell et al (1986) Science 232:738-743; Kaniewski et al (1990) Bio/Tech. 8:750-754; Day 
etal (19911 Proc. Natl. Acad. Sci. USA 88:6721-6725V phytase (U.S. Patent No. 5,593,963); 
resistance to bacterial, fungal, nematode and insect pests, including resistance to the 
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lepidoptera insects conferred by the Bt gene (U.S. Patent Nos. 5,597,945 and 5,597,946; 
Hilder et al Nature 330:160-163; Johnson et al (1989) Proc. Natl. Acad. Sci. USA 86:987 1- 
9875; Perlak et al (1990) Bio/rech . 8:939-943); lectins (U.S. Patent No. 5,276,269); and 
flower color (Meyer et al (1987) Nature 330:677-678; Napoli et a/.(1990) ElaDUCsU 2:279- 
5 289 (1990); van der Krol et al (1990) Plant Cell 2:291-299). 

vm. Expression Units to Express Exogenous DNA in a Plant 

The present invention further provides host cells transformed with a nucleic acid 
molecule that encodes a protein of the present invention. The host cell can be either 

1 0 prokaryotic or eukaryotic. Exikaryotic cells useful for expression of a protein of the invention 
are not limited so long as the cell line is compatible with cell culture methods and compatible 
with the propagation of the expression vector and expression of the gene product. Preferred 
eukaryotic host cells include any plant species. 

Any prokaryotic host can be used to express a rDN A molecule encoding a protein 

1 5 of the invention. The preferred prokaryotic host is E. coli. 

Transformation of appropriate cell hosts with a rDNA molecule of the present 
invention is accomplished by well known methods that typically depend on the type of vector 
used and host system employed. With regard to transformation of prokaryotic host cells, 
electroporation and salt treatment methods are typically employed, see, for example, Cohen 

20 et al Proc. Nad. Acad. Sci. USA 69:21 10-21 14; and Maniatis et al (1982) Molecular 

Cloning - A Laboratorv Manual. Cold Spring Harbor Laboratory Press. With regard to 
transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al (1973) 
Virologv 52:456^67; and Wigler et al (1979) Proc. Natl. Acad. Sci. USA 76:1373-1376. 

25 Successfiilly transformed cells, cells that contain a rDNA molecule of the present 

invention, can be identified by well known techniques including the selection for a selectable 
marker. For example, cells resulting fi'om the introduction of an rDNA of the present 
invention can be cloned to produce single colonies. Cells firom those colonies can be 
harvested, lysed and their DNA content examined for the presence of the rDNA using a 

30 method such as that described by Southem, (1975) J. Mol. Biol. 98:503-517; or Berent et al 



wo 01/64928 



PCT/USOl/06622 



33 

(1985) Biotech. Histochem. 3:208; or the proteins produced from the cell assayed via an 
immunological method. 

As pn)vided herein elsewhere, several embodiments of the present invention employ 
expression units (or expression vectors or systems) to express an exogenously supplied 
5 ." nucleic acid sequence, such as the sequence coding for SH2-REV6, SH2HS33, and SH2- 
REV6-HS protein in a plant. Methods for generating expression units/systems/vectors for use 
in plants are well known in the art and can readily be adapted for use in expressing 
polynucleotide sequences encoding proteins of the present invention, such as SH2-REV6, 
SH2HS33, and SH2-REV6-HS proteins, in a plant cell. A skilled artisan can readily use any 
10 appropriate plant/vector/expression system in the present methods following the outiine 
provided herein. 

The expression control elements used to regulate the expression of the protein can 
either be the expression control element that is normally found associated with the coding 
sequence (homologous expression element) or can be a heterologous expression control 

15 . element. A variety of homologous and heterologous expression control elements are known 
in the art and can readily be used to make expression units for use in the present invention. 
Transcription initiation regions, for example, can include any of the various opine initiation 
regions, such as octopine, mannopine, nopaline and the like that are found in the Ti plasmids 
of Agrobacterium tumefaciens. Alternatively, plant viral promoters can also be used, such 

20 as thg^ cauliflower mosaic virus 35S promoter to control gene expression in a plant. Lastly, 
plant promoters such as prolifera promoter, fruit-specific promoters, Ap3 promoter, heat 
shock promoters, seed-specific promoters, etc, can also be used. The most preferred 
promoters will be most active in seedlings. 

Either a constitutive promoter (such as the CaMV or Nos promoter), an oigan-specific 

25 promoter (such as the E8 promoter from tomato) or an inducible promoter is typically ligated 
to the protein or antisense encoding region using standard techniques known in the art. The 
expression unit may be fiirther optimized by employing supplemental elements such as 
transcription terminators and/or enhancer elements. 

Thus, for expression in plants, the expression units will typically contain, in addition 

30 to the protein sequence, a plant promoter region, a transcription initiation site and a 
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transcription termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of 
the expression unit are typically included to allow for easy insertion into a preexisting vector. 

In the construction of heterologous promoter/structural gene or antisense 
combinations, the promoter is preferably positioned about the same distance firom the 
heterologous transcription start site as it is from the transcription start site in its natural 
setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

In addition to a promoter sequence, the expression cassette can also contain a 
transcription termination region downstream of the stmctural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. If the mRNA encoded by the structural 
gene is to be efficiently processed, DNA sequences which direct polyadenylation of the RNA 
are also commonly added to the vector construct. Polyadenylation sequences include, but are 
not limited to the Agrobacterium octopine synthase signal (Gielen et al (1984) EMBO J 
3:835-846) or the nopaline synthase signal (Depicker et al (1982) Mol. and AppL Genet 1 : 
561-573). 

The resulting expression unit is ligated into or otherwise constructed to be included 
in a vector which is ^ropriate for higher plant transformation. The vector will also typically 
contain a selectable marker gene by which transformed plant cells can be idratified in culture. 
Usually, the marker gene will encode antibiotic resistance. These markers include resistance 
to 041 8, hygromycin, bleomycin, kanamycin, and gentarrucin. After transforming the plant 
cells, those cells having the vector will be identified by their ability to grow on a medium 
containing the particular antibiotic. Replication sequences, of bacterial or viral origin, are 
generally also included to allow the vector to be cloned in a bacterial or phage host, 
preferably a broad host range prokaryotic origin of replication is included. A selectable 
marker for bacteria should also be included to allow selection of bacterial cells bearing the 
desired construct. Suitable prokaryotic selectable markers also include resistance to 
antibiotics such as kanamycin or tetracycline. 

Other DNA sequences encoding additional functions may also be present in the 
vector, as is known in the art. For instance, in the case o{ Agrobacterium transformations, 
T-DN A sequences will also be included for subsequent transfer to plant chromosomes. 
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The polynucleotide sequences of the subject invention, such as the Sh2-Rev6, 
Sh2hs33, and Sh2-Rev6'HS sequences, utilized in the present invention can also be fused to 
various other nucleic acid molecules such as Expressed Sequence Tags (ESTs), epitopes or 
fluorescent protein markers. 

5 ESTs are gene fragments, typically 300 to 400 nucleotides in lengtii, sequenced from 

the 3' or 5' end of complementary-DNA (cDNA) clones. Nearly 30,000 Arabidopsis thaliana 
ESTs have been produced by a French and an American consortium (Delseny et al (1 997) 
FEES Lett. 405(2): 129-132; Arabidopsis thaliana Database, 
http://genome.www.stanford.edu/Arabidopsis). For a discussion of the analysis of gene- 

10 expression patterns derived fit»m large EST databases, see, e.g., M. R. Fannon (1996) 
TIBTECH 14:294-298. 

Biologically compatible fluorescent protein probes, particularly the self-assembling 
green fluorescent protein (GFP) from the jellyfish Aequorea victoria, have revolutionized 
research in cell, molecular and developmental biology because they allow visualization of 

1 5 biochemical events in living cells (Murphy et al ( 1 997) Curr. Biol . 7(1 1 ):870-876; Grebenok 
et al. (1997) Plant J . 1 1(3):573-586; Pang et al, (1996) Plant Phvsiol. 1 12(3); Chiu et al 
(1996) Cuir. Biol . 6(3):325-330; Plautz et al, (1996) Qaie 1 73(l):83-87; Sheen et al (1995) 
Plant J. 8(5):777-784). 

Site-directed mutatgenesis has been used to develop a more soluble version of the 

20 codon-modified GFP call soluble-modified GFP (smGFP). When introduced into 
Arabidopsis, greater fluorescence was observed when compared to the codon-modified GFP, 
implying that smGFP is 'brighter' because more of it is present in a soluble and functional 
form (Davis et al (1998) Plant MoL Biol . 36(4):52 1-528). By fusing genes encoding GFP 
and beta-glucuronidase (GUS), researchers were able to create a set of bifunctional reporter 

25 constructs which are optimized for use in transient and stable expression systems in plants, 
including Arabidopsis (QuaedvUeg et al (1998) Plant MoL Biol. 37(4):7 15-727). 

Berger et al (Berg et al (1998) Dev. Biol . 194(2):226-234) report the isolation of a 
GFP marker line for Arabidopsis hypocotyl epidermal cells. GFP-fusion proteins have been 
used to localize and characterize a number of Arabidopsis genes, including geranylgeranyl 

30 pyrophosphate (GGPP)(Zhu era/. (1997;^ Plant Mol. Biol . 35(3):331-341). 
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DC. Rreedin^ Methods 

Open«Pollinated Populations . The improvement of open-pollinated populations of 
such crops as rye, many maizes and sugar beets, heibage grasses, legumes such as alfalfa and 
clover, and tropical tree crops such as cacao, coconuts, oil palm and some rubber, depends 

5 essentially upon changing gene-frequencies towards fixation of favorable alleles while 

maintaining a high (but far from maximal) degree of heterozygosity. Uniformity in such 
populations is impossible and tmeness-to-type in an open-pollinated variety is a statistical 
feature of the population as a whole, not a characteristic of individual plants. Thus, the 
heterogeneity of open-pollinated populations contrasts with the homogeneity (or virtually so) 

1 0 of inbred lines, clones and hybrids. 

Population improvement methods fall naturally into two groups, those based on 
purely phenotypic selection, nomially called mass selection, and those based on selection 
with progeny testing. Interpopulation improvement utilizes the concept of open breeding 
populations; allowing genes for flow from one population to another. Plants in one 

15 population (cultivar, strain, ecotype, or any germplasm source) are crossed either naturally 
by wind) or by hand or by bees (commonly Apis mellifera L. or Megachile rotundata 
F.) with plants fxom other populations. Selection is applied to improve one (or sometimes 
both) population(s) by isolating plants with desirable traits from both sources. 

There are basically two primary methods of open-pollinated population improvement. 

20 First, there is the situation in which a population is changed en masse by a chosen selection 
procedure. The outcome is an improved population which is indefinitely propagable by 
random-mating within itself in isolation. Second, the synthetic variety attains the same end 
result as population improvement but is not itself propagable as such; it has to be 
reconstructed from parental lines or clones. These plant breeding procedures for improving 

25 open-pollinated populations are well known to those skilled in the art and comprehensive 
reviews of breeding procedures routinely used for improving cross-pollinated plants are 
provided in numerous texts and articles, including: Allard, (1960) Principles of Plant 
Breeding . John Wiley & Sons, Lie; Simmonds (\919^ Principles of Crop Improvement . 
Longman Group Limited; Hallauer and Miranda (1981) Quantitative Genetics in Maize 

30 Breeding . Iowa State University Press; and, Jensen (1988) Plant Breeding Methodology . John 

Wiley & Sons, Inc. 
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Mass Selection . In mass selection, desirable individual plants are chosen, harvested, 
and the seed composited without progeny testing to produce the following generation. Smce 
selection is based on the maternal parent only, and their is no control over pollination, mass 
selection amounts to a forni of random mating with selection. As stated above, the purpose 
of mass selection is to increase the proportion of superior genotypes in the population. 

Synthetics . A synthetic variety is produced by crossing inter se a number of 
genotypes selected for good combining ability in all possible hybrid combinations, with 
subsequent maintenance of the variety by open pollination. Whether parents are (more or less 
inbred) seed-propagated lines, as in soine sugar beet and beans (Vicia) or clones, as in 
herbage grasses, clovers and alfalfe, makes no difference in principle. Parents are selected 
on general combining ability, sometimes by test crosses or topcrosses, more generally by 
polycrosses. Parental seed lines maybe deliberately inbred (eg., by selfing or sib crossing). 
However, even if the parents are not deliberately inbred, selection within lines during line 
maintenance will ensure that some inbreeding occurs. Clonal parents will, of course, remain 
unchanged and highly heterozygous. 

Whether a synthetic can go straight from the parental seed production plot to the 
famier or must first undergo one or two cycles of multiplication depends on seed production 
and the scale of demand for seed. In practice, grasses and clovers are generally multiplied 
once or twice and are thus considerably rdnoved fi:om the original synthetic. 

While mass selection is sometimes used, progeny testing is generally preferred for 
polycrosses, because of their operational simplicity and obvious relevance to the objective, 
namely exploitation of general combining ability in a synthetic. 

The number of parental Unes or clones that enter a synthetic vary widely. In practice, 
numbers of parental lines range fix)m 10 to several hundred, with 100-200 being the average. 
Broad based synthetics formed from 100 or more clones would be expected to be more stable 
during seed multiplication than narrow based synthetics. 

Hvbrids . A hybrid is an individual plant resuking from a cross between parents of 
differing genotypes. Commercial hybrids are now used extensively in many crops, including 
com (maize), sorghum, sugarbeet, sunflower and broccoli. Hybrids can also be produced in 
wheat and rice. Hybrids can be formed a number of different ways, including by crossing two 
parents directly (single cross hybrids), by crossing a single cross hybrid with another parent 
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(three-way or triple cross hybrids), or by crossing two different hybrids (four-way or double 
cross hybrids). 

Strictly speaking, most individuals in an outbreeding (i.e,, open-pollinated) population 
are hybrids, but the term is usually reserved for cases in which the parents are individuals 

5 whose genomes are suflFiciently distinct for them to be recognized as different species or 
subspecies. Hybrids may be fertile or sterile depending on qualitative and/or quantitative 
differences in the genomes of the two parents. Heterosis, or hybrid vigor, is usually 
associated with increased heterozygosity which results in increased vigor of growth, survival, 
and fertility of hybrids as compared with the parental lines which were used to form the 

1 0 hybrid. Maximum heterosis is usually achieved by crossing two genetically different, highly 
inbred lines. 

The production of hybrids is a well-developed industry, involving the isolated 
production of both the parental lines and the hybrids which result from crossing those lines. 
For a detailed discussion of the hybrid production process, see, e.g., Wright, Commercial 
15 Hybrid Seed Production 8:161-176> In Hybridization of Corp Plants , supra. 

X. Seed Number. Grain Yield, and Sink Capacity in Wheat 

Wheat seed number and subsequent grain yield is affected by competition between 
inflorescences (Whingwiri et ai, 1981). Wheat yield is always lower than ears potential due 

20 to lack of assimilate supply or competition among florets limiting seed size and/or number 

(Zamski and Grunberger, 1995). Healthy, well-grown wheat plants always produce more 
shoots (potential heads) and florets (potential seeds) than heads and seeds. A significant 
factor controlling seed number is sink strength of the developing seeds (Thome and Wood, 
1987). A review of this area (Evans et ai, 1975) indicates for wheat in many cases, yield is 

25 limited by the sink capacity of developing seeds. The limitations imparted by low sink 
strength may be seen as reduced grain set, reduced number of wheat heads, and reduced 
individual seed weight. In wheat, it is generally believed that the rate of assimilate flow to 
developing heads determines the survival of initiated florets and plays a significant role in 
determining final grain number (Spiertz and vanKeulen, 1980; Abbate et al,^ 1998). Possibly 

30 the most effective method of increasing kernel number in wheat would be to modify 
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assimilate flow to developing kernels (Bindraban et a/., 1998). The transgenic wheat of the 
present invention which contains increased sink strength confirms this hypothesis. 

All patents, patent applications, provisional applications, and publications referred to 
5 or cited herein are incorporated by reference in their entirety to the extent they are not 
inconsistent with the explicit teachings of this specification. 

Materials and Methods 
I. Production of Transgenic Plants 
10 Vectors according to the invention may be used to transform plants as desired, to 

make plants according to the invention as discussed elsewhere herein. 

Wheat Transformation . The methods described by Weeks et al (1993) and Vasil et 
al (1993) have been adopted with minor modifications for transforming the wheat cultivar 
*Hi-Line' (Lanning et al, 1992). The technique as routinely practiced initially utilizes 
1 5 immature embryos isolated from wheat cultivars approximately 7 days post anthesis. 

The BioUstic PDS-1000 He (Bio-Rad laboratories, USA) device was used for 
transforming the wheat tissues via microprojectile bombardment. 

For wheat calli 1500 psi rupture discs were used. Other procedures such as 
sterilization of the rupture discs, macrocaniers, stopping screens etc, were strictly in 
20 accordance with the manufacturer's manual. 

Rice Transformation . The methods described by Sivamani et al. (1996) may be 
adopted for transforming rice cultivar 'M202' (Johnson et al 1986). The technique as 
routinely practiced initially utilizes embryogenic calli cultured fi-om mature seeds. 

The Biolistic PDS-1000 He (Bio-Rad laboratories, USA) device is used for 
25 transfonning the rice tissues via microprojectile bombardment. 

For rice caUi 1 500 psi rupture discs are used. Other procedures such as sterilization 
of the mpture discs, macrocarriers, stopping screens e/c, are strictly in accordance with the 
manufacturer's manual. 

Pea TransfoTmation . The methods described by U.S. Patent No. 5,286,635 (Example 
30 9) and U.S. Patent No. 5,773,693 (Example V) may be adopted with minor modifications for 

transforming the pea {Pisum sativum L) cultivar Tea Green Arrow' (available commercially 
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from Park Seed®). Pea explant material is transfomied by incubation with Agrobacterium 
cells carrying Sh2'Rev6-HS sequence. The pea explant is preferably obtained from the 
plumule of a pea seed, and transformed shoots are preferably induced directly in the explant 
material without passage through a callus phase. Whole transformed pea plants may be 
5 regenerated from the transformed shoots by rooting and subsequent planting in the soil. The 
exogenous Shl-RevS-HS DNA will be stably incorporated into the chromosomes of the 
regenerated 'Pea Green Arrow' plant which will be able to express the gene. 

n. Plasmids 

1 0 Wheat . The plasmid DNA pRQ 1 0 1 containing the coding sequence of the Bar gene 

(Fromm et ai, 1990) under control of the CaMV 35S promoter with AdhI intron and NOS 
terminator was used as selectable marker for selecting transgenic wheat tissue. 

Rice . As a selectable marker for rice, the plasmid DNA pILTAB222 containing the 
coding sequence of the hygromycin B phosphotransferase under the control of the maize 

1 5 ubiquitin promoter was used (Sivamani et al, 1 996), 

Pea . As a selectable marker for pea, the coding sequence of cefotoxime resistance 
may be used according to U.S. Patent No. 5,773,693. This anti-Agrobacterium antibiotic may 
be used in the selection and regeneration medium (500 mg/1) used for growing the pea callus. 
General . The marker genes (i.e.. Bar, hygromycin resistance, or cefotaxime) were on 

20 diflFerent construct than S/i2-i?ev5-//5 genes. 

For the introduction of the Sh2'Rev6'HS genes into cereals, plasmid pSh2-Rev6-HS 
were created. Besides containing Shl-RevS-HS cDNA, the plasmid also contained the Sh2 
promoter, Shi first intron, and NOS terminator (Rogers et aL, 1987). Specifically, plasmid 
pSh2-Rev6-HS contains the following nucleotide fragments linked in the 5' to 3* direction: 

25 nucleotides -1084 to +36 of the .Sh2 promoter; 8 nucleotides of polylinker; two C's; 
nucleotides of the Shi intron 1 cassette containing nucleotides +43 to +52 of Shi exon 1, 
nucleotides +53 to +1080 of Shi intron 1 and nucleotides +1081 to 1097 of Shi exon 2; one 
C; 13 nucleotides of polylinker containing a BamHl restriction site; cDNA encoding Sh2- 
Rev6-HS (SEQ ID NO: 3); 18 nucleotides polylinker containing Kpnl and Sstl restriction 

30 sites; and nucleic acid of the NOS terminator. The nucleic acid sequence of the Sh2 promoter 
is disclosed by Shaw and Hannah, (1992), Plant Physiology . 98:1214-1216. The sequence 
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numbering of Shi intron cassette is shown in Zack et ai (1 986) Mavdica . 31,5-16, and the 
effects of the Shi intron 1 cassette on transient gene expression are described by Clancy et 
ai ((1994) Plant Science . 98, 151-161) and Vasil et ai ((1989), Plant Science . 91, 1575- 
1579). The 3 additional C's (2 at the 5* end and 1 at the 3' end) are subcloning derived 
nucleotides. The plasmid includes transit peptide and consensus start site. Plasmid pSh2- 
Rev6-HS used in the present application is provided by Florida State University. 

For introduction of Shl-Rev-HS into dicots such as pea, the above plasmid is 
modified such that the Sh2 promoter is replaced w^ith a dicot seed specific promoter such as 
pea vicilin promoter (U.S. Patent No. 5,773,693). Other suitable promoters and/or constmcts 
for expression of Sh2-Rev6-HS in dicots are well known to the skilled artisan (see, e.g., U.S. 
Patent No. 5,773,693). 

m. Selection and Regeneration of Transgenic Plants 

Wheat . Transgenic wheat plants were obtained fi"om bombarded inunature embryos 
by the methods described by Weeks et ai (1993) and Vasil et ai (1993) using bialaphos 
(Meiji Seika Kaisha Ltd, Japan) selection. The resistant calli of wheat are transferred to 
medium to induce production of both shoots and roots. 

Rice . Transgenic rice plants were obtained fi-om the bombarded embryogenic calli 
of rice by the technique of Sivamani et ai (1996) using hygromycin selection. The resistant 
calli of rice are transfen^d to mediimi to induce production of both shoots and roots. 

Peas. Transgenic pea plants may be obtained fijom Agrobacterium-transformed calli 
of pea explants by the method of U.S. Patent No. 5,773,693 using cefotaxime selection. 

Pea shoots may be rooted by transfer to Sori3arod plugs (Baumgartnen Papiers SA, 
Switzeriand) and soaked in liquid YRM according to U.S. Patent No. 5,773,693 (Example 
V). 

General . Putative transgenic plantlets were transferred to the greenhouse and allowed 
to self-fertilize. For wheat, typically more than 75% of these plantlets are escapes and true 
transgenic plants were selected by spraying the plants with 0.1% glufosinate (Liberty®, 
Agrevo Inc.). 
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IV. Primers for PGR 

An Sh2 specific primer and a NOS specific primer for PGR were used to confirm the 
presence of Sh2-Rev&HS transgene in the transgenic plants. The 5' primer was MG4Sh2, a 
26-mer which is specific to Sh2 sequences in the construct: 

5' GTG GAT GTG AAG TGA AGG ACT CCG TG 3' (SEQ ID NO: 5). 

The 3' primer was MC35PUC19, a 24-mer specific to the puc backbone of the 
construct: 

5' GGC TTA ACT ATG CGG CAT CAG AGC 3' (SEQ JD NO: 6). 
The primers produce a PGR product of 826 bp (309 bp o{Sh2 cDNA, 260 bp of 
NOS,and257bppUC19). 

Following are examples which illustrate procedures for practicing the invention. 
These examples should not be construed as limiting. All percentages are by weight and all 
solvent mixture proportions are by volume unless otherwise noted. 

Example 1 — Genetic Analysis of Transgenic Wheat Plants 

The initial pool of wheat transformants yielded a number of independent 
transformants which wctc transgenic for Sh2-Rev6'HS and/or basta resistance. 

The Toplants were allowed to set seed and mature in the greenhouse under controlled 
conditions. 

The selected wheat transformants were analyzed by PCR for the presence of the 
introduced transgene and for T, seed segregation data for basta resistance. 

PCR screening of transgenic wheat plants utilized MC4Sh2 and MC35PUC19 
(primer sequences given above) for the presence of Sh2-R€v6'HS in genomic DNA samples 
prepared firom leaf tissue using standard PCR protocols. 

Twenty seven independent lines of transgenic wheat were tested. AH 27 transgenic 
lines tested positive for basta resistance. Fifteen of the 27 transgenic lines tested positive for 
the presence of the Sh2-Rev6'HS transgene and the other 12 did not test positive for the 
presence of the Sh2'Rev6-HS transgene. 
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Kxample 2— Phenotvpic Analysis of the Transgenic Wheat Plants 

Various phenotypic traits were collected and analyzed for each of the 27 transgenic 
wheat plants grown in a greenhouse. As mentioned previously, all 27 transgenic lines carried 
the hert)icide resistance gene. These traits included the following: number of seeds per plant 
(Seeds/Plant); individual seed weight (hidividual Seed Wt.) in milligrams per kernel 
(mg/kemel); Harvest Index (Harvest Index); total seed weight (Total Seed Wt.) in grams per 
plant (g/plant); number of grain heads per plant (Heads); total plant weight (Plant Wt.) in 
grams per plant (g/plant); and flag leaf weight (Flag Leaf Wt.) in grams per plant (g/plant). 

Seeds were uniformly dried in a 37°C incubator to a moisture of between about 10% 
to about 14%. 

The above ground parts of the plants were harvested at time of maturity and 
uniformly dried to about 0% moisture in a 125 °C incubator. The dried plant weights and 
dried flag leaf weights were adjusted to reflect weights at the same moisture content as that 
of the seeds (z.e., about 10% to about 14%). Roots were not collected. 

Plant weight represents the total weight of the "above ground" plant parts not 
including the total seed weight of the plant and the flag leaf weight of the plant. 

Harvest Index (HI) was calculated as follows: 

HI = {(Total Seed Wt.)/(Total Seed Wt. + Plant Wt. + Flag Leaf Wt.)} . 

For the number of wheat heads per plant, the number of heads were counted without 
regard to whether or how many seeds were in any particular head, 

The phenotypic data were analyzed in several different ways, as discussed below. 

Comparison Between PCR -h and PGR- Lines. This comparison was made for all 
transgenic lines (15 lines) with positive PGR results (PCR+) for Sh2-Rev6'HS versus all 
transgenic lines (12 lines) with negative PGR results (PGR-) for Sh2'Rev6-HS. Thus, the 
PGR+ lines carry both the herbicide resistance gene and the Sh2-Rev6-HS gene while the 
PGR- lines only carry the herbicide resistance genes. The results are presented in Table 1. 
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Table 1. Comparison Between PCR+ and PCR- Lines 


Number 




Seeds/ 


irKJivrauai 


Tr»f?st 

1 Olol 


narvesi 


neaos 


Dion* lA/t 

r^iani vvi. 


Plan 

nag 


■a rsf 
Jr OT 


and Types 




Plants 


Seed Wt. 


Seed Wt. 


Index 






Leaf 


Plants 


of Lines 
















Wt. 










(mg/kemei) 


(g/plant) 




# 


(g/plant) 


(g/plant) 




15 PGR + 


Avg 


63.68" 


25^- 


1.73" 


0.30** 


5.71 


3.74 


0.40 


183 




Std 


10.59 


1.80 


0.34 


0.08 


0.86 


2.38 


0.20 




12PCR- 


Avg 


' 53.50 


24.10 


1.37 


0.25 


5.82 


3.64 


0.41 


148 




Std 


9.6 


1.50 


0.26 


0.10 


0.81 


1.74 


0.18 




PCT+/PCR- 




1.19* 


1.05— 


1.26" 


1.17** 


0.98 


1.03 


0.97 





indicate p values of less than or equal to: 0.05, 0.01 . or 0.001 , respectively, based on a t test. 



10 

Comparison Between SH2+ and SH2- . In the second comparison, only the 8 
transgenic lines with positive PCR results for Sh2-Rev6'HS which also displayed increases 
in the levels of the introduced protein (SH2+) were averaged and compared with all other 
lines (SH2-). The 8 PCR+ lines which are SH2+ are the lines for which increased levels of 
1 5 the introduced protein were detected. Basically, the SH2 levels were compared to those of 
the lines which were transgenic for only the herbicide resistance gene. Those experimental 
plants which produced 25% or more of the SH2 protein as compared to the production of 
SH2 by the lines which were transgenic for only the herbicide resistance gene were 
designated as "SH2+". 

20 The SH2+ lines were compared to the other 19 lines ("SH2-") which lack any 

significant expression of the introduced protein. Thus, the 1 9 SIC- lines includes the 7 PCR-f- 
lines which did not express significant levels of the SH2-REV6-HS protein and the 12 PCR- 
lines which did not express SH2-REV6-HS at all. The data is presented in Table n. 



Table II. Comparison Between SH2+ and SH- 


Number 




Seeds/ 


Individual 


Total 


Harvest 


Heads 


Plant Wt. 


Flag Leaf 


#0f 


and Types 




Plants 


Seed Wt. 


Seed 


Index 






Wt. 


Plants 


of Lines 








Wt. 


















(mg/kemel) 


(g/plant) 






(g/plant) 


(g/plant) 




8SH2+ 


Avg 


78.23*" 


26.9'** 


2.1 9*** 


0.32**' 


6.00 


4.29*** 


0.45"' 


100 




Std 


45.50 


4.10 


1.35 


0.10 


2.93 


2.29 


0.21 




19 SH2- 


Avg 


50.86 


23.80 


1.30 


0.25 


5.66 


3.44 


0.39 


231 




Std 


33-65 


4.70 


0.94 


0.13 


2.38 


1.99 


0.16 




SH2+/SH2- 




1.54— 


1.13*** 


1.68*** 


1.25"' 


1.06 


1.25— 


1.15" 





indicate p values of less than or equal to: 0.05. 0.01. or 0.001. respectively, based on a t-tests. 
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The data presented in Table n show that the total number of seeds per plant for the 
SH2+ lines increased about 54% in comparison to the total number of seeds per plant for the 
SH2- lines. Individual seed weight increased about 13% and the total seed yield increased 
about 68% for the SH2+ lines when compared to the SH2- lines. Harvest Index for the SH+ 
lines was about 25% greater than that for the SH2- lines. The SH2+ lines were also 
significantly bigger in total plant mass and in flag leaf wei^t (about +25% and about +1 5%, 
respectively). 

Comparison Between Lines Homozygous and Heterozvgous for Sh2-Rev6-HS Tl 
plants found to be homozygous by progeny testing the T2 seeds were designated as "Homoz 
SH2+". The seeds of Homoz SH2+ plants are expected to have a greater dosage of the 
transgene than the other lines. In this comparison, the Homoz SH2 plants were compared to 
SH2+ plants that were heterozygous (Heteroz SH2+) and were also compared to the 12 PCR- 
lines. 



15 



20 



25 



30 



Table III. Comparison Between Lines Homozygous 
and Heterozygous for Sh2-Rev6'HS 


Types of 
Unes 




Seeds/ 
Plants 


Individual 
Seed Wt. 


Tola) 
Seed 
Wt. 


Han«st 
Index 


Heads 


Plant 
Wt. 


Flag Leaf 
Wt. 


#of 
Plants 








(mg/kemel) 


(g/plant) 






(g/plant) 


(g/plant) 




Homoz 
SH2+ 


Avg 
Std 


97.r 

43.60 


27.60 
2.60 


2.74* 

1.31 


0.30 
0.03 


7.59** 
3.10 


5.76 
2.25 


0.56 
0.17 


22 


Heteroz 
SH2+ 


Avg 
Std 


74.50 
45.10 


26.80 
3.80 


2.10 
1.34 


0.33 
0.10 


5.58 
Z54 


3.91 
2.08 


0.42 
0.20 


66 


SH2+ 
Homoz/ 
Heteroz 




1.30* 


1.03 


1.30* 


^.05 


1.36'* 


1.47 


1.33 


22/66 


SH2 
Homoz/ 
12PCR- 




1.91*** 


1.16*** 


2.10*^ 


1.16* . 


1.34" 


1.67— 


1.43"* 


22/148 



' indicate p values of less than or equal to: 0.05. 0.01 . or 0.001. respectively, based on t tests. 



The majority (approximately two-thirds) of the plants analyzed were determined to 
be heterozygous for Sh2-Rev6-HS and, therefore, only have half of the possible dosage of the 
transgene coding for SH2-REV6-HS. 
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Example 6— AG P activity and Tl Seed Weight of SH2-REV6-HS Transgenic Lines 



Plant 

rianT 




Tl .<!eed Weisht 




inn 




IViJ 1 


1 77 


27 0 me 


RS2 


127 


27.5 mg 


RS3 


121 


23.8 mg 


RS4 


120 


27.1 mg 


RS6 


124 


25.6 mg 


RS8 


122 


21.9mg 


RSIO 


147 


27.7 mg 


RS17 


106 


20.0 mg 


RS20 


114 


. 23.8 mg 


RS21 


100 


22.7 mg 


RS22 


127 


21.5 rag 



AGP activity assays reflect a mean of three replicates performed usiiig an extract 
prepared from a minimum of 10 developing seeds. Activities are expressed relative to the 
average value obtained for varietal control plant M202. Tl seed weights are averages of a 
random subsample of mature Tl seeds harvested from individual TO transgenic lines. 

At the AGP activity level, the majority of the Sh2'Rev6-HS transgenic rice lines have 
significant increases relative to M202. Lines RSI 7 and RS21 do not have significant 
increases in AGP activity. Line RSIO exhibits the highest level of overexpression of all lines 
at the RNA level and also has highest extractable AGP activity. 

Example 7— RSI Tl Growth Chamber Yield Study 

Sixteen Tl plants (numbered 1, 3, 4, 5, 6, 7, 10, 13, 15, 17, 18, 19, 20, 22, 23, and 25, 
respectively) representing Sh2'Rev6'HS transgenic rice line RSI were grown in a growth 
chamber and compared with five M202 and 5 of control transgenic line 97-3 (the 97-3 line 
carries only hygromycin resistance). The sixteen RSI Tl plants and the 5 97-3 plants came 
from individual seeds germinated on petri plates using hygromycin selection and were then 
transplanted into soil. The 97-3 plants are homozygous for a hygromycin resistance gene 
locus and the RSI Tl plants are heterozygous (12 of 16) or homozygous (4 of 16) for the 
hyffomycm/Sh2'Rev6'HS transgene locus. The dosage of each RS 1 Tl plant was determined 
by progeny tests. RS 1 plants 1 0, 1 8, 1 9, and 20 are homozygous. Difficulty in establishing 
the M202 plants may be a consequence of their being direct seeded into soil. The results are 
shown in Table IV below. 
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To determine the effect of increased gene dosage, individual T, plants were 
determined to be homozygous or heterozygous by progeny testing the T, seeds harvested 
from the plants. Lack of segregation for the herbicide resistance marker gene was taken as 
evidence of homozygosity. A comparison of 22 SH2+ homozygous plants with heterozygous 
5 SH+ plants indicates that increasing the dosage of the Sh2'Rev6-HS transgene leads to even 
larger yield and plant growth increases over plants which do not contain or express the 
transgene. The results provided in Table EI indicate an approximately 1 10% increase in total 
seed weight per plant compared to SH2+ heterozygous plants. 

10 Example 3— Experiments with Rice 

The transgenic rice plants are produced as described in the Materials and Methods. 
The resultant rice plants are analyzed as set forth in Examples 1 and 2. 

Example A — Experiments with Pea 
1 5 The transgenic pea plants are produced as described in the Materials and Methods, 

The resultant pea plants are analyzed as set forth in Examples 1 and 2. 

Example 5— Northern Analysis of SH2>REV6-HS Transgenic Rice Lines 

Ten (10) or more developing seeds were harvested from individual TO transgenic rice 
20 lines. All TO transgenic lines were PCR positive for the ^W-Z^evd-Z/S transgene. RNAwas 

prepared and analyzed according to standard techniques. Duplicate blots were probed with 

a small AGP subunit probe (Brittle-2) or the Sh2'Rev6'HS transgene coding sequences. The 

genotype labeled M202 is a varietal control. 

As can be seen in Figure 1, RSI, RS4, RSIO, RS20, and RS22 transgenic plants 
25 express the Sh2-Rev6-HS transgene, in contrast to untransfora[ied M202 plant which does not 

express the transgene. Due to small differences in loading, minor differences in expression 

may or may not be due to the transgene. Significant differences in loading are not apparent 

in a duplicate blot probed with the Brittle-2 gene. 
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Table IV. 



Genotype 


Panicle # 


Total 


Total Seed 


Seed Wt./ 


Seeds/ 


Individual 


Harvest 






Seed# 


Wt. 


Panicle 


Panicle 


Seed Wt. 


Index 


M202 1 


16 


385 


9.94 


0.62 


24.06 


0.0258 


031 


2 


12 


409 


11,04 


0.92 


34.08 


0.0270 


0.39 


3 


15 


647 


17.11 


1.14 


43.13 


0.0264 


038 


4 


14 


413 


10.46 


0.75 


29.50 


0-0253 


0.39 


5 


24 


751 


20.07 


0.84 


31.29 


0.0267 


0.35 


avg 


16.2 


521 


13.724 


0.85 


32.41 


0.0263 


036 


std 


4.12 


14932 


4.10 


0.17 


6.28 


0.0006 


0.03 


97-3 1 


31 


789 


20.01 


0.65 


25.45 


0.0254 


0.35 


2 


32 


1040 


27 


0.84 


32.50 


0.0260 


0.38 


3 


28 


688 


17.85 


0.64 


24.57 


0.0259 


032 


4 


18 


623 


16.08 


0,89 


34,61 


0.0258 


0.37 


5 


32 


895 


23.57 


0.74 


27.97 


0.0263 


0.37 


avg 


28.2 


807 


20.902 


0.75 


29.02 


0.0259 


036 


std 


5J1 


148.58 


3.94 


0.10 


3.92 


0.0003 


0.02 


RSI 1 


33 


943 


24.35 


0.74 


28.58 


0.0258 


0.34 


3 


18 


686 


17.48 


0.97 


38.11 


0.0255 


0.30 


4 


26 


932 


25.91 


1.00 


35.85 


0.0278 


0.36 


5 


14 


540 


14.51 


1.04 


38.57 


0.0269 


0.34 


6 


33 


891 


24.76 


0.75 


27.00 


0.0278 


0.34 


7 


■ 32 


889 


23.97 


0.75 


27.78 


0.0270 


0.36 


10 


20 


587 


15.23 


0.76 


' 2935 


0.0259 


031 


13 


12 


562 


14.63 


1.22 


46.83 


0.0260 


0,32 


15 


20 


585 


15.14 


0.76 


29.25 


0.0259 


0.32 


17 


17 


678 


17.84 


1.05 


39.88 


0.0263 


033 


18 


9 


538 


13.94 


1.55 


59.78 


0.0259 


0.40 


19 


23 


609 


16.05 


0.70 


26.48 


0.0264 


0.33 


20 


19 


594 


15.96 


0.84 


31.26 


0.0269 


031 


22 


11 


553 


14.25 


1.30 


50.27 


0.0258 


039 


23 


18 


559 


14.77 


0.82 


31.06 


0.0264 


031 


25 


19 


708 


19.27 


1.01 


37.26 


0.0272 


0.36 


avg 


20.25 


67838 


18.00 


0.95 


36.08 


0.0265 


034 


std 


7.28 


144.96 


4.14 


0.23 


9.17 


0.0007 


0.03 


RSI Compared to M202 




1.25 


130 


131 


1.12 


1.11 


1 1.01 


0.94 


RSI Compared to 97-3 




1 0.72 


0.84 


0.86 


1.27 


1.24 


1 1.02 


0.94 
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While this initial study of RSI indicates variability between and within genotypes, 
some observations may be valid. First, that RSI Tl plants averaged greater seed weight per 
panicle than either control genotype. Second, that RSI Tl's averaged a greater number of 
seeds per panicle than either control genotype. This component of yield, seed number per 
5 panicle, is the largest positively affected parameter in the wheat transformation experiments 
that have been performed using Sh2-Rev6'HS. 



The foregoing detailed description has been given for clearness of understanding only 
and no unnecessary limitations should be understood therefrom as modifications will be 

1 0 obvious to those skilled in the art. While the invention has been described in connection with 
specific embodiments thereof, it will be understood that it is capable of fiuther modifications 
and this application is intended to cover any variations, uses, or adaptations of the invention 
following, in general, the principles of the invention and including such departures fi*om the 
present disclosure as come within known or customary practice within the art to which the 

1 5 invention pertains and as may be applied to the essential features hereinbefore set forth and 
as follows in the scope of the appended claims. 
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Claims 

I claim: 

1 1 . A method of increasing flie number of seeds produced by a plant, wherein the 

2 method comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS or an SH2RTS polypeptide; and 

1 3 b. growing the plant produced in step a, 

1 2. A method of increasing the biomass produced by a plant, wherein the method 

2 comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

1 2 encoding an SH2HS polypeptide; and 

1 3 b. growing the plant produced in step a. 



9' 
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1 3. A method of increasing the Harvest Index of a plant, wherein the method 

2 comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ ID NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

12 encoding an SH2HS polypeptide; and 

13 b. growing the plant produced in step a. 

1 4. The method of claim 1 , 2, or 3 wherein the plant is a monocotyledonous plant 

1 5 . The method of claim 4 wherein the plant is selected from the group consisting 

2 of rice, wheat, barley, oats, sorghum, and millet plant. 

1 6. The method of claim 1 , 2, or 3 wherein the plant is a dicotyledonous plant. 

1 7. The method of claim 6 wherein the plant is selected from the group consisting 

2 of pea, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, small 

3 bumet, soybean, and lettuce plant 

1 8. A method of increasing the flag leaf weight of a monocotyledonous plant, 

2 wherein the method comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 
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7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ JD NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

12 encoding an SH2HS polypeptide; and 

1 3 b. growing the plant produced in step a. 

1 9. A method of increasing the number of seed heads produced by a 

2 monocotyedonous plant, wherein the method comprises: 

3 a. introducing into a plant a nucleic acid operably linked to a promoter, 

4 wherein the nucleic acid is selected from the group consisting of a 

5 nucleic acid comprising SEQ JD NO: 3, a nucleic acid which 

6 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

7 encodes a polypeptide that retains biological activity of SH2-REV6- 

8 HS, a fragment of SEQ JD NO: 3 encoding a peptide that retains 

9 biological activity of SH2-REV6-HS, a nucleic acid encoding a 

10 polypeptide comprising SEQ JD NO: 4, or a fragment thereof that 

1 1 retains biological activity of SH2-REV6-HS, and a nucleic acid 

12 encoding an SH2HS polypeptide; and 

1 3 b. growing the plant produced in step a. 

1 1 0. A method of increasing two or more traits of a dicotyledonous plant, wherein 

2 the traits are selected from the group consisting of the number of seeds, average seed weight, 

3 total seed weight, number of seed heads, Harvest Index and total plant weight, wherein the 

4 method comprises: 

5 a. introducing into a plant a nucleic acid operably linked to a promoter, 

6 wherein the nucleic acid is selected from the group consisting of a 

7 nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 

8 hybridizes with SEQ ID NO: 3 under high stringency conditions and 

9 encodes a polypeptide that retains biological activity of SH2-REV6- 
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HS, a fragment of SEQ ED NO: 3 encoding a peptide that retains 
biological activity of SH2-REV6-HS, a nucleic acid encoding a 
polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 
retains biological activity of SH2-REV6-HS, and a nucleic acid 
encoding an SH2HS polypeptide; and 
b. growing the plant produced in step a. 

11. A method of increasing the yield of two or more traits of a monocotyledonous 
plant, wherein the traits are selected from the group consisting of the number of seeds, 
average seed weight, total seed weight, number of seed heads, flag leaf weight, Harvest Index 
and total plant weight, wherein the method comprises: 

a. introducing into a plant a nucleic acid operably linked to a promoter, 
wherein the nucleic acid is selected from the group consisting of a 
nucleic acid comprising SEQ ID NO: 3, a nucleic acid which 
hybridizes with SEQ ID NO: 3 under high stringency conditions and 
encodes a polypeptide that retains biological activity of SH2-REV6- 
HS, a fragment of SEQ K) NO: 3 encoding a peptide that retains 
biological activity of SH2-REV6-HS, a nucleic acid encoding a 
polypeptide comprising SEQ ID NO: 4, or a fragment thereof that 
retains biological activity of SH2-REV6-HS, and a nucleic acid 
encoding an SH2HS polypeptide; and 

b. growing the plant produced in step a. 

12. The method of claim 1, 2, 3, 8, 9, 10, or 1 1 fiirfher comprising crossing the 
plant obtained in step b with a second plant and harvesting and growing the seed which is 
produced as a result of making the cross. 

1 3 . The method of claim 1 , 2, 3, 8, 9, 1 0, or 1 1 ftirther comprising harvesting seed 
which is produced by selfmg the plant obtained in step b and growing the harvested seed. 



57 

1 14. The method of claim 8, 9, or 1 1 wherein the plant is selected jfrom the group 

2 consisting of rice, wheat, barley, oats, sorghum, and millet plant. 

1 15. The method of claim 1, 2, 3, 8. 9, 10, or 1 1 wherein the SH2HS polypeptide 

2 is selected from the group consisting of SH2HS13, SH2HS14, SH2HS16, SH2HS33, 

3 SH2HS39, SH2HS40, and SH2HS47, or a fragment of the SH2HS polypeptide that retains 

4 biological activity of the SH2HS polypeptide. 

1 16. The method of claim 1, 2, 3, 8, 9, 10, or 1 1 wherein the SH2RTS polypeptide 

2 is selected from the group consisting of SH2RTS48-2 and SH2RTS60- 1 , or a fragment of the 

3 SH2RTS polypeptide that retains biological activity of the SH2RTS polypeptide. 

1 17. The method of claim 10 wherein the plant is selected from the group 

2 consisting of pea, alfalfa, birdsfoot trefoil, chickpea, chicory, clover, kale, lentil, prairie grass, 

3 small bumet, soybean, and lettuce plant. 

1 18. A plant produced by the method of claim 1,2,3,8,9, 10, or 11. 

1 19. A plant comprising a nucleic acid encoding the amino acid sequence as set 

2 forth in SEQ ID NO: 4. 
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SEQUENCE LISTING 

<110> Giroux, Michael 

<120> Transgenic Plants with Increased Seed Yield, Biomass and Harvest Index 
<130> RDI-100 

<140> US 09/516,250 
<141> 2000-03-01 

<160> 6 

<170> PatentIn Ver. 2.1 

<210> 1 

<211> 1582 

<212> DNA 

<213> Zea mays 

<220> 

<221> CDS 

<222> (10) . . (1563) 

<223> Shrunken-2 gene revertant form 

<220> 

<221> variation 
<222> (267) 

<223> k = g or t; amino acid 86 = Ala. 
<220> 

<221> variation 
<222> (1368) 

<223> r = a or g; amino acid 453 « Pro. 
<220> 

<221> variation 
<222> (1578) 
<223> k = g or t. 

<400> 1 

ggaggagat atg cag ttt gca ctt gca ttg gac acg aac tea ggt cot cac 51 
Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His 
1 5 . 10 

cag ata aga tct tgt gag ggt gat ggg att gac agg ttg gaa aaa tta 99 
Gin lie Arg Ser Cys Glu Gly Asp Gly lie Asp Arg Leu Glu Lys Leu 
15 20 25 30 

agt att ggg ggc aga aag cag gag aaa get ttg aga aat agg tgc ttt 147 
Ser lie Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe 
35 40 45 

ggt ggt aga gtt get gca act aca caa tgt att ctt acc tea gat get 195 
Gly Gly Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala 
50 55 60 

tgt act gaa act ctt cat tct caa aca cag tec tct agg aaa aat tat 243 
Cys Pro Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr 
65 70 75 
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get gat aca aac cgt gta tct gck ate att ttg ggc gga ggc act gga 291 
Ala Asp Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly 
80 85 90 

tet cag etc ttt cct ctg aca age aca aga get acg cet get gta eet 339 
Ser Gin Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pre 
95 100 105 110 

gtt gga gga tgt tac agg ett att gat ate cct atg agt aac tge ttc 387 
Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe 
115 120 125 

aac agt ggt ata aat aag ata ttt gtg atg agt eag ttc aat tct act 435 
Asn Ser Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr 
130 135 140 

teg ctt aac cgc cat att cat cgt aca tac ctt gaa ggc ggg ate aac 483 
Ser Leu Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn 
145 150 155 . 

ttt get gat gga tet gta eag gta tta gcg get aca eaa atg cct gaa 531 
Phe Ala Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu - 
160 ' 165 170 

gag eca get gga tgg ttc eag ggt aca gca gae tct ate aga aaa ttt 579 
Glu Pro Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe 
175 180 185 190 

ate tgg gta etc gag gat tat tac agt cae aaa tee att gae aac att 627 
He Trp Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He 
195 200 205 

gta ate ttg agt ggc gat cag ctt tat egg atg aat tac atg gaa ctt 675 
Val He Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu 
210 • 215 220 

gtg eag aaa cat gtc gag gae gat get gat ate act ata tea tgt get 723 
Val Gin Lvs His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala 
225 230 235 

eet gtt gat gag age ega get tct aaa aat ggg eta gtg aag att gat 771 
Pro Val Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp 
240 245 250 

eat act gga egt gta ctt eaa ttc ttt gaa aaa eca aag ggt get gat 819 
His Thr Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp 
255 260 • 265 270 

ttg aat tct atg aga gtt gag ace aac ttc ctg age tat get ata gat 867 
Leu Asn Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp 
275 280 285 

gat gca eag aaa tat eca tac ctt gca tea atg ggc att tat gtc ttc 915 
Asp Ala Gin Lys Tyr Pro Tyr Leu A.la Ser Met Gly He Tyr Val Phe 
290 295 300 

aag aaa gat gca ett tta gae ett etc aag tea aaa tat act eaa tta 963 
Lys Lys Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu 
305 310 315 
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cat gac ttt gga tct gaa ate etc cca aga get gta eta gat cat agt 

His Asp Phe Gly Ser Glu lie Leu Pro Arg Ala Val Leu Asp His Ser 
320 325 330 

gtg cag gca tgc att ttt acg ggc tat tgg gag gat gtt gga aca ate 

Val Gin Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He 

335 340 345 350 



ttg cet ccg acg caa ttg gae aag tgc aag atg aaa tat gca ttt ate 
Leu Pro Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe He 
385 390 395 



gga gtc tgc tea egt gtc age tct gga tgt gaa etc aag gac tee gtg 
Gly Val Cys Ser Arg Val Ser Ser Gly Cys Glu Leu Lys Asp Ser Val 
415 420 425 430 



tac teg tac tac ata agg tct gga ate gtg gtg ate ctg aag aat gca 
Tyr Ser Tyr Tyr He Arg Ser Gly He Val Val He Leu Lys Asn Ala 
495 500 505 510 



1011 



1059 



aaa tea ttc ttt gat gca aac ttg gee etc act gag cag cct tec aag 1107 

Lys Ser Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys 
355 360 365 

ttt gat ttt tac gat cca aaa aca cct ttc ttc act gca ccc cga tgc 1155 

Phe Asp Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys 

370 375 • 380 



1203 



tea gat ggt tgc tta ctg aga gaa tgc aac ate gag eat tct gtg att 1251 
Ser Asp Gly Cys Leu Leu Arg Glu Cys Asn He Glu His Ser Val He 
400 405 410 



1299 



atg atg gga gcg gac ate tat gaa act gaa gaa gaa get tea aag eta 1347 
Met Met Gly Ala Aso He Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu 
435 440 445 

ctg tta get ggg aag gtc ccr gtt gga ata gga agg aac aca aag ata 1395 
Leu Leu Ala Gly Lys Val Xaa Val Gly He Gly Arg Asn Thr Lys He 
450 455 460 

agg aac tgt ate att gac atg aat get agg att ggg aag aac gtg gtg 1443 
Arg Asn Cys He He Asp Met Asn Ala Arg He Gly Lys Asn Val Val 
465 470 475 

ate aca aac agt aag ggc ate caa gag get gat cac ccg gaa gaa ggg 1491 
He Thr Asn Ser Lys Gly He Gin Glu Ala Asp His Pro Glu Glu Gly 
480 485 490 



1539 



ace ate aac gat ggg tct gtc ata tagatcggct gcgtktgcg 1582 
Thr He Asn Asp Gly Ser Val He 
515 



<210> 2 
<211> 518 
<212> PRT 
<213> Zea mays 

<220> 
<221> - 
<222> (86) 

<223> Xaa = Ala, from degeneracy at position 267 in DNA sequence (k = g or t) . 
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<220> 
<221> - 
<222> (453) 

<223> Xaa = Pro, from degeneracy at position 1368 in DMA sequence (r = a or 
g) . 

<400> 2 

Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His Gin He 
1 5 10- 15 

Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu Ser He 
20 25 30 

Glv Glv Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe Gly Gly 
35 40 45 

Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala Cys Pro 
50 55 60 

Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr Ala Asp 
65 70 75 80 

Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly Ser Gin 
85 90 95 

Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro Val Gly 
100 105 110 

Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe Asn Ser 
115 120 125 

Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr Ser Leu 
130 135 140 

Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn Phe Ala 
145 150 155 160 

Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu Glu Pro 
165 170 175 

Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe He Trp 
180 185 190 

Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He Val He 
195 200 205 

Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu Val Gin 
210 215 220 

Lys His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala Pro Val 
225 230 235 240 

Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp His Thr 
245 250 255 

Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp Leu Asn 
260 265 270 

Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp Asp Ala 
275 280 285 
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Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe Lys Lys 
290 295 300 

Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu His Asp 
305 310 315 320 

Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp His Ser Val Gin 

325 330 335 

Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He Lys Ser 
340 345 350 

Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys Phe Asp 
355 360 365 

Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys Leu Pro 
370 375 380 

Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe He Ser Asp 
385 390 395 400 

Gly Cys Leu Leu Arg Glu Cys Asn He Glu His Ser Val He Gly Val 
405 410 415 

Cys Ser Arg Val Ser Ser Gly Cys Glu Leu Lys Asp Ser Val Met Met 
420 425 430 

Gly Ala Asp He Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu Leu Leu 
435 440 445 

Ala Gly Lys Val Xaa Val Gly He Gly Arg Asn Thr Lys He Arg Asn 
450 455 460 

Cys He He Asp Met Asn Ala Arg He Gly Lys Asn Val Val He Thr 
465 470 475 480 

Asn Ser Lys Gly He Gin Glu Ala Asp His Pro Glu Glu Gly Tyr Ser 

485 490 495 

Tyr Tyr He Arg Ser Gly He Val Val He Leu Lys A.sn Ala Thr He 
500 505 510 



Asn Asp Gly Ser Val He 
515 



<210> 3 
<211> 1582 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (10) . . (1563) 

<223> Shrunl<:en-2 gene revertant form, modified to be 
heat stable 

<220> 

<221> variation 
<222> (267) 

<223> k = g or t; amino acid 86 = Ala. 
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<220> 

<221> variation 
<222> (1008) 
<223> y c or t. 

<220> 

<221> variation 
<222> (1368) 

<223> r = a or g; amino acid 453 = Pro. 

<220> 

<221> variation 
<222> (1578) 
<223> )c = g or t. 

<400> 3 

ggaggagat atg cag tttgca ctt gca ttg gac acg aac tea ggt cct cac 51 
Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His 
15 10 

cag ata aga tct tgt gag ggt gat ggg att gac agg ttg gaa aaa tta 99 
Gin He Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu 
15 20 25 30 

agt att ggg ggc aga aag cag gag aaa get ttg aga aat agg tgc ttt 147 
Ser He Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe 
35 40 45 

ggt ggt aga gtt get gca act aca caa tgt att ctt acc tea gat get 195 
Gly Gly Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala 
50 55 60 

tgt cct gaa act ctt cat tct caa aca cag tec tct agg aaa aat tat 243 
Cys Pro Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr 
65 70 75 

get gat gca aac cgt gta tct gclc ate att ttg ggc gga ggc act gga 291 
Ala Asp Ala Asn Arg Val Ser Xaa He He Leu Gly Gly Gly Thr Gly 
80 * 85 90 

tct cag etc ttt cct ctg aca age aca aga get acg cct get gta cct 339 
Ser Gin Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro 
95 100 105 110 

gtt gga gga tgt tac agg ctt att gat ate cct atg agt aac tgc ttc 387 
Val Gly Gly Cys Tyr Arg Leu He Asp He Pro Met Ser Asn Cys Phe 
115 120 125 

aac agt ggt ata aat aag ata ttt gtg atg agt. cag ttc aat tct act 435 
Asn Ser Gly He Asn Lys He Phe Val Met Ser Gin Phe Asn Ser Thr 
130 135 140 

teg ctt aac cgc cat att cat cgt aca tac ctt gaa ggc ggg ate aac 483 
Ser Leu Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn 
145 150 155 

ttt get gat gga tct gta cag gta tta gcg get aca caa atg cct gaa 531 
Phe Ala Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu 
160 165 170 
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gag cca get gga tgg ttc cag ggt aca gca gac tct ate aga aaa ttt 
Glu Pro Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe 
175 180 185 190 



579 



ate tgg gta etc gag gat tat tac agt cac aaa tec att gac aac att 627 
He Trp Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He 
195 200 205 

gta ate ttg agt ggc gat cag ctt tat egg atg aat tac atg gaa ctt 675 
Val He Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu 
210 215 220 

gtg cag aaa cat gtc gag gac gat get gat ate act ata tea tgt get 723 
Val Gin Lys His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala 
225 230 235 

cet gtt gat gag age ega get tct aaa aat ggg eta gtg aag att gat 771 
Pro Val Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp 
240 245 250 

cat act gga cgt gta ctt caa ttc ttt gaa aaa cca aag ggt get gat 819 
His Thr Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp 
255 260 265 270 

ttg aat tct atg aga gtt gag ace aac ttc ctg age tat get ata gat 867 
Leu Asn Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp 
275 280 285 

gat gca cag aaa tat cca tac ctt gca tea atg ggc att tat gtc ttc 915 
Asp Ala Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe 
290 295 300 

aag aaa gat gca ctt tta gac ctt etc aag tea aaa tat act caa tta 963 
Lys Lys Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu 
305 310 315 

cat gac ttt gga tct gaa ate etc cca aga get gta eta gat tay agt 1011 
His Asp Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp Tyr Ser 
320 325 330 

gtg cag gca tge att ttt acg ggc tat tgg gag gat gtt gga aca ate 1059 
Val Gin Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He 
335 340 345 350 

aaa tea ttc ttt gat gca aac ttg gee etc act gag cag ect tec aag 1107 
Lys Ser Phe Phe Asp Ala Asn Leu Ala Leu Thr Glu Gin Pro Ser Lys 
355 360 365 

ttt gat ttt tac gat cca aaa aca ect ttc ttc act gca ecc ega tge 1155 
Phe Asp Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys 
370 375 380 

ttg ect ccg acg caa ttg gac aag tge aag atg aaa tat gca ttt ate 1203 
Leu Pro Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe He 
385 390 395 

tea gat ggt tge tta ctg aga gaa tge aac ate gag cat tct gtg att 1251 
Ser Asp Gly Cys Leu Leu Arg Glu Cys Asn He Glu His Ser Val He 
400 405 410 
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gga gtc tgc tea cgt gtc age tct gga tgt gaa ete aag gae tee gtg 

Gly Val Cys Ser Arg Val Ser Ser Gly Cys Giu Leu Lys Asp Ser Val 

415 420 425 430 



1299 



atg atg gga gcg gae ate tat gaa act gaa gaa gaa get tea aag eta 1347 
Met Met Gly Ala Asp lie Tyr Glu Thr Glu Glu Glu Ala Ser Lys Leu 
435 440 445 

ctg tta get ggg aag gtc ccr gtt gga ata gga agg aac aea aag ata 1395 
Leu Leu Ala Gly Lys Val Xaa Val Gly lie Gly Arg Asn Thr Lys He 
450 455 460 

agg aac tgt ate att • gae atg aat get agg att ggg aag aac gtg gtg 1443 
Arg Asn Cvs lie He Asp Met Asn Ala Arg He Gly Lys Asn Val Val 
465 470 475 

ate aea aae agt aag gge ate eaa gag get gat eae eeg gaa gaa ggg 1491 
He Thr Asn Ser Lys Gly He Gin Glu Ala Asp His Pro Glu Glu Gly 
480 485 490 

tac teg tae tac ata agg tet gga ate gtg gtg ate ctg aag aat gea 1539 
Tyr Ser Tyr Tyr He Arg Ser Gly He Val Val He Leu Lys Asn Ala 
495 500 505 510 

aee ate aae gat ggg tet gtc ata tagategget gcgtktgeg 1582 
Thr He Asn Asp Gly Ser Val He 
515 



<210> 4 
<211> 518 
<212> PRT 
<213> Zea mays 

<220> 
<221> - 
<222> (86) - 

<223> Xaa = Ala, from degeneracy at position 267 in DNA sequence (k = g or t) 

<220> 
<221> - 
<222> (453) 

<223> Xaa = Pro, from degeneracy at position 1368 in DNA sequence (r = a or 

g) . 

<400> 4 

Met Gin Phe Ala Leu Ala Leu Asp Thr Asn Ser Gly Pro His Gin He 

15 10 15 

Arg Ser Cys Glu Gly Asp Gly He Asp Arg Leu Glu Lys Leu Ser He 
20 25 30 

Gly Gly Arg Lys Gin Glu Lys Ala Leu Arg Asn Arg Cys Phe Gly Gly 
35 40 45 

Arg Val Ala Ala Thr Thr Gin Cys He Leu Thr Ser Asp Ala Cys Pro 
50 55 60 

Glu Thr Leu His Ser Gin Thr Gin Ser Ser Arg Lys Asn Tyr Ala Asp 
65 70 75 80 
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Ala Asn Arg Val Ser Xaa lie He Leu Gly Gly Gly Thr Gly Ser Gin 
85 90 95 

Leu Phe Pro Leu Thr Ser Thr Arg Ala Thr Pro Ala Val Pro Val Gly 
100 105 110 

Gly Cys Tyr Arg Leu He Asp He Pro ^4et Ser Asn Cys Phe Asn Ser 
115 120 125 

Gly He Asn Lvs He Phe Val Met Ser Gin Phe Asn Ser Thr Ser Leu 
130 " 135 140 

Asn Arg His He His Arg Thr Tyr Leu Glu Gly Gly He Asn Phe Ala 

145 150 155 160 

Asp Gly Ser Val Gin Val Leu Ala Ala Thr Gin Met Pro Glu Glu Pro 
165 170 175 

Ala Gly Trp Phe Gin Gly Thr Ala Asp Ser He Arg Lys Phe He Trp 
180 185 190 

Val Leu Glu Asp Tyr Tyr Ser His Lys Ser He Asp Asn He Val He 
195 200 205 

Leu Ser Gly Asp Gin Leu Tyr Arg Met Asn Tyr Met Glu Leu Val Gin 
210 215 220 

Lys His Val Glu Asp Asp Ala Asp He Thr He Ser Cys Ala Pro Val 

225 230 235 240 

Asp Glu Ser Arg Ala Ser Lys Asn Gly Leu Val Lys He Asp His Thr 
245 250 255 

Gly Arg Val Leu Gin Phe Phe Glu Lys Pro Lys Gly Ala Asp Leu Asn 
260 265 270 

Ser Met Arg Val Glu Thr Asn Phe Leu Ser Tyr Ala He Asp Asp Ala 
275 280 285 

Gin Lys Tyr Pro Tyr Leu Ala Ser Met Gly He Tyr Val Phe Lys Lys 
290 295 300 

Asp Ala Leu Leu Asp Leu Leu Lys Ser Lys Tyr Thr Gin Leu His Asp 
305 310 315 320 

Phe Gly Ser Glu He Leu Pro Arg Ala Val Leu Asp Tyr Ser Val Gin 
325 330 335 

Ala Cys He Phe Thr Gly Tyr Trp Glu Asp Val Gly Thr He Lys Ser 

340 345 ■ 350 

Phe Phe Asp Ala Asn Leu -Ala Leu Thr Glu Gin Pro Ser Lys Phe Asp 
355 360 . 365 



Phe Tyr Asp Pro Lys Thr Pro Phe Phe Thr Ala Pro Arg Cys Leu Pro 

370 375 380 

Pro Thr Gin Leu Asp Lys Cys Lys Met Lys Tyr Ala Phe He Ser Asp 

385 390 395 400 

Gly Cys Leu Leu Arg Glu Cys Asn He Glu His Ser Val He Gly Val 

405 410 415 
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Cys Ser Arg Val Ser Ser Gly Cys 
420 

Gly Ala Asp He Tyr Glu Thr Glu 
435 440 

Ala Gly Lys Val Xaa Val Gly He 
450 455 

Cys He He Asp Met Asn Ala Arg 
465 470 

Asn Ser Lys Gly He Gin Glu Ala 
485 

Tyr Tyr He Arg Ser Gly He Val 
500 

Asn Asp Gly Ser Val He 
• 515 



10 



Glu Leu Lys Asp Ser Val Met Met 
425 430 

Glu Glu Ala Ser Lys Leu .Leu Leu 
445 

Gly Arg Asn Thr Lys He Arg Asn 
460 

He Gly Lys Asn Val Val He Thr 
475 480 

Asp His Pro Glu Glu Gly Tyr Ser 
490 495 

Val He Leu Lys Asn Ala Thr He 
505 510 



<210> 5 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PGR primer 

<400> 5 

ctggatgtga actcaaggac tccgtg 



<210> 6 
<2H> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 



<400> 6 

ggcttaacta tgcggcatca gage 



24 



